Introduction to BigQuery row-level security
This document explains the concept of row-level security, how it works in BigQuery, when to use row-level security to secure your data, and other details.
What is row-level security?
Row-level security lets you filter data and enables access to specific rows in a table based on qualifying user conditions.
BigQuery supports access controls at the project, dataset, and table levels, as well as column-level security through policy tags. Row-level security extends the principle of least privilege by enabling fine-grained access control to a subset of data in a BigQuery table, by means of row-level access policies.
One table can have multiple row-level access policies. Row-level access policies can coexist on a table with column-level security as well as dataset-level, table-level, and project-level access controls.
How row-level security works
At a high level, row-level security involves the creation of row-level access policies on a target BigQuery table. These policies act as filters to hide or display certain rows of data, depending on whether a user or group is in an allowed list. Any users or groups not specifically included in the allowed list are denied access.
An authorized user, with the Identity and Access Management (IAM) roles BigQuery Admin or BigQuery DataOwner, can create row-level access policies on a BigQuery table.
When you create a row-level access policy, you specify the table by name, and
which users or groups (called the grantee-list
) can access certain
row data. The policy also includes the data on which you want to filter, called
the filter_expression
. The filter_expression
functions like a WHERE
clause in a typical query.
For instructions on how to create and use a row-level access policy, see Working with row-level security.
See the DDL reference for the complete syntax, usage, and options when creating row-level access policies.
Example use cases
The following examples demonstrate potential use cases for row-level security.
Filter row data based on region
Consider the case where the table dataset1.table1
contains rows belonging to
different regions (denoted by the region
column).
You can create and populate the example table by using the following query:
CREATE TABLE IF NOT EXISTS dataset1.table1 (partner STRING, contact STRING, country STRING, region STRING); INSERT INTO dataset1.table1 (partner, contact, country, region) VALUES ('Example Customers Corp', 'alice@examplecustomers.com', 'Japan', 'APAC'), ('Example Enterprise Group', 'bob@exampleenterprisegroup.com', 'Singapore', 'APAC'), ('Example HighTouch Co.', 'carrie@examplehightouch.com', 'USA', 'US'), ('Example Buyers Inc.', 'david@examplebuyersinc.com', 'USA', 'US');
Row-level security lets a data owner or administrator implement policies. The following statement implements a policy that restricts users in the APAC mailing group to see only partners from the APAC region:
CREATE ROW ACCESS POLICY apac_filter ON dataset1.table1 GRANT TO ("group: sales-apac@example.com") FILTER USING (region="APAC" );
The resulting behavior is that users in the sales-apac@example.com
group can
view only rows where the value for region
is APAC
.
The following statement implements a policy that restricts both individuals and groups to see only partners from the US region:
CREATE ROW ACCESS POLICY us_filter ON dataset1.table1 GRANT TO ("group:sales-us@example.com", "user: jon@example.com") FILTER USING (region="US");
The resulting behavior is that users in the group sales-us@example.com
and the
user jon@example.com
can view only rows where the value for region
is US
.
The following image shows how the previous two access policies restrict which users and groups can view which rows in the table:
Users that aren't in the APAC
or US
groups don't see any rows.
Filter row data based on sensitive data
Now, consider a different use case, where you have a table that contains salary information.
You can create and populate the example table by using the following query:
CREATE OR REPLACE TABLE dataset1.table1 (name STRING, department STRING, salary INT64, email STRING); INSERT INTO dataset1.table1 ( name, department, salary, email) VALUES ('Jim D', 'HR', 100000, 'jim@example.com'), ('Anna K', 'Finance', 100000, 'anna@example.com'), ('Bruce L', 'Engineering', 100000, 'bruce@example.com'), ('Carrie F', 'Business', 100000, 'carrie@example.com');
The row access policy in the following statement restricts querying to members
of the company domain. In addition, the use of the SESSION_USER()
function
restricts access only to rows that belong to the user running the query, based
on their user email address.
CREATE ROW ACCESS POLICY salary_personal ON dataset1.table1 GRANT TO ("domain:example.com") FILTER USING (Email=SESSION_USER());
The following image demonstrates how the row access policy restricts the table containing salary
information. In this example, the user is named Jim, with the email address jim@example.com
.
Filter row data based on lookup table
To provide feedback or request support with this feature, send email to bigquery-row-level-security-support@google.com.With subquery support, row access policies can reference other tables and use them as lookup tables. Data used in filtering rules can be stored in a table and a single subquery row access policy can replace multiple configured row access policies. To update the row access policies, you only need to update the lookup table, which replaces multiple row access policies. You don't need to update each individual row access policy.
When to use row-level security versus other methods
Authorized views, row-level access policies, and storing data in separate tables all provide different levels of security, performance, and convenience. Choosing the right mechanism for your use case is important to ensure the proper level of security for your data.
Comparison with authorized views: vulnerabilities
Both row-level security and enforcing row-level access with an authorized view can have vulnerabilities, if used improperly.
When you use either authorized views or row-level access policies for row-level security, we recommend that you monitor for any suspicious activity using audit logging.
Side channels, such as the query duration, can leak information about rows that are at the edge of a storage shard. Such attacks would likely require either some knowledge of how the table is sharded, or a large number of queries.
For more information about preventing such side-channel attacks, see Best practices for row-level security.
Comparison of authorized views, row-level security, and separate tables
The following table compares the flexibility, performance, and security of authorized views, row-level access policies, and separate tables.
Method | Security considerations | Recommendation |
---|---|---|
Authorized views |
Recommended for flexibility. Can be vulnerable to carefully crafted queries, query durations, and other types of side-channel attacks. | Authorized views are a good choice when you need to share data with others and flexibility and performance are important. For example, you can use authorized views to share data within your work group. |
Row-level access policies | Recommended for a balance of flexibility and security. Can be vulnerable to query duration side-channel attacks. | Row-level access policies are a good choice when you need to share data with others and you want to provide additional security over views or table slices. For example, you can use row-level access policies to share data with people who all use the same dashboard, even if some people have access to more data than others. |
Separate tables | Recommended for security. Users cannot infer data without access to the table. | Separate tables are a good choice when you need to share data with others and you need to keep data isolated. For example, you can use separate tables to share data with third-party partners and vendors, when the total number of rows must be secret. |
Create and manage row-level access policies
For information about how to create, update (re-create), list, view, and delete row-level access policies on a table, and how to query tables with row-level access policies, see Working with row-level access security.
Quotas
For more information about quotas and limits for row-level security, see BigQuery Quotas and limits.
Pricing
Row-level security is included with BigQuery at no additional cost. However, a row-level access policy can affect the cost of running a query in the following ways:
Additional billing can be caused by row-level access policies, specifically policies that include subqueries that reference other tables.
Row-level access policy filters don't participate in query pruning on partitioned and clustered tables. This does not mean it reads more data during the main query execution. It doesn't take advantage of row access policy predicates to prune any further.
With row-level access policy filters, not all user filters are applied early. This might increase the data read from tables and might read and bill for more rows.
For more information about BigQuery query pricing, see BigQuery pricing.
Limitations
For information about limits for row-level security, see BigQuery Row-level security limits. The following sections document additional row-level security limitations.
Performance limitations
Some BigQuery features aren't accelerated when working with tables containing row-level access policies, such as BigQuery BI Engine and materialized views.
Row-level security does not participate in query pruning, which is a feature of partitioned tables. For more information, see Partitioned and clustered tables. This limitation doesn't slow down the main query execution.
You might experience a small performance degradation when you query tables with row-level security.
For more information about how row-level security interacts with some BigQuery features and services, see Using row-level security with other BigQuery features.
Other limitations
This feature may not be available when using reservations that are created with certain BigQuery editions. For more information about which features are enabled in each edition, see Introduction to BigQuery editions.
Row access policies are not compatible with Legacy SQL. Queries of tables with row-level access policies must use GoogleSQL. Legacy SQL queries are rejected with an error.
You cannot apply row-level access policies on JSON columns.
Some features of BigQuery are not compatible with row-level security. For more information, see Using row-level security.
Non-query operations, including service account jobs, that need full access to table data can use row-level security with the
TRUE
filter. Examples include table copying, dataproc workflows, and more. For more information, see Using row-level security.Creating, replacing, or deleting row-level access policies must be performed with DDL statements. Listing and viewing row-level access policies can be performed through the Google Cloud console or the bq command-line tool.
Previewing or browsing tables is incompatible with row-level security.
Table sampling is not compatible with row-level security.
Top-level subquery policy results are limited to 100 MB. This limit applies per row-level access policy.
If the row-level access policy predicate cannot be evaluated due to the deletion of any referenced table, the query fails.
Subquery row-level access policies only support BigQuery tables, BigLake external tables, and BigLake managed tables.
Audit logging and monitoring
When data in a table with one or more row-level access policies is read, the row-level access policies authorized for the read access and any corresponding tables referenced in subqueries appear in the IAM authorization information for that read request.
Creation and deletion of row-level access policies are audit logged, and can be
accessed through Cloud Logging. Audit logs
include the name of the row-level access policy. However, the
filter_expression
and grantee_list
definitions of a row-level access
policy are omitted from logs, as they may contain user or other sensitive
information. Listing and viewing of row-level access policies are not audit
logged.
For more information about logging in BigQuery, see Introduction to BigQuery monitoring.
For more information about logging in Google Cloud, see Cloud Logging.
What's next
For information about managing row-level security, see Use row-level security.
For information about how row-level security works with other BigQuery features and services, see Using row-level security with other BigQuery features.
For information about best practices for row-level security, see Best Practices for row-level security in BigQuery.