This document describes how to use search in Dataplex Universal Catalog to search for resources such as BigQuery datasets, Cloud SQL instances, and others.
Dataplex Universal Catalog offers two search modes: keyword search and semantic search (Preview).
Keyword search lets you find resources using specific keywords, filters, and a defined syntax.
Semantic search extends keyword search to support natural language queries. It lets you find resources using everyday language, eliminating the need for complex syntax.
For more information about the Google Cloud assets that are supported in Dataplex Universal Catalog, see Supported Google Cloud sources .
Search scope
The search results in Dataplex Universal Catalog respect permissions that you have over the corresponding resources in source systems.
For example, if you have BigQuery metadata read access to an object, that object appears in your Dataplex Universal Catalog search results. If you have access to a BigQuery table but not to the dataset containing that table, the table still shows up as expected in the Dataplex Universal Catalog search.
By default, search is scoped to your organization. Results include only resources from the same organization as the project you're searching in.
The search results include only those resources that belong to the same VPC-SC perimeter as the project under which search is performed. When using the Google Cloud console, this is the project that is selected in the console.
To broaden the scope of your search results beyond the resources within your project's VPC Service Controls perimeter, use VPC Service Controls ingress and egress rules . These rules facilitate private and efficient data exchange across your organization. You can configure ingress and egress rules using the Google Cloud console or through JSON or YAML files. Refer to the following YAML example and consult the VPC Service Controls documentation to tailor the rule to your specific requirements.
egressPolicies
:
-
egressFrom
:
identityType
:
ANY_USER_ACCOUNT
egressTo
:
# Specify which resources should be present in the search results. In this example,
# BigQuery.
operations
:
-
methodSelectors
:
-
method
:
'*'
serviceName
:
bigquery.googleapis.com
# Specify project ids under which the search is performed.
resources
:
-
projects/ SEARCH_PROJECT_ID
ingressPolicies
:
-
ingressFrom
:
identityType
:
ANY_USER_ACCOUNT
sources
:
-
accessLevel
:
'*'
ingressTo
:
# Specify which resources should be present in the search results. In this example,
# BigQuery.
operations
:
-
methodSelectors
:
-
method
:
'*'
serviceName
:
bigquery.googleapis.com
# Specify project ids to expose in search results.
resources
:
-
projects/ INGRESS_PROJECT_ID
For more information about the Identity and Access Management roles that you need to use Dataplex Universal Catalog search, see Dataplex Universal Catalog IAM roles .
Isolate search results by environment using VPC Service Controls
To isolate Dataplex Universal Catalog search results between environments like development, test, and production, configure separate VPC Service Controls perimeters for each environment. Assign both the projects that contain the data assets and the projects that are used for performing searches to the corresponding environment's perimeter. Searches that are performed from a project within a specific perimeter will only return results for assets that are also located within that same perimeter.
Recall limitations in search
Dataplex Universal Catalog search queries don't guarantee full recall. Results that match your query mightn't be returned. Additionally, returned (and not returned) results can vary if you repeat search queries.
To query all Dataplex Universal Catalog metadata, you can export the metadata to Cloud Storage and then query it from BigQuery. For more information, see Export metadata .
Filters
Filters let you narrow down the search results. Filters are available in both keyword and semantic search, but the available options might differ slightly depending on the mode.
When you provide filters in multiple sections, they are evaluated using the AND
logical operator. The search results contain resources that match at least
one condition from every selected section. For example, if you select the
BigQuery system and the dataset resource type, the search
results includes BigQuery datasets but not
Vertex AI datasets.
If you select multiple filters within a single section, they are evaluated using
the OR
logical operator. For example, if you select the dataset resource type
and the table resource type, the search results includes both datasets and
tables.
Keyword search filters
For keyword search, filters are grouped into the following sections:
- Systemssuch as BigQuery, Cloud SQL, and others. The Dataplex Universal Catalog system contains custom entries.
- Aspects (tags)list all aspects available to you.
- Projectlists all projects available to you.
- Type aliasesdescribe resource types, such as databases, datasets, models, tables, views, services, and custom types.
- Datasetscome from BigQuery.
Filter by aspect value
The Aspectsfilters let you query for assets tagged using a specific
template. You can use the Customizemenu to further refine results and
filter by specific aspect values. The aspect value filter conditions depend on
that aspect field's data type. For example, for the datetime
and number
fields, you can specify a specific date or a range.
Filter visibility
The filters Systems, Type aliases, Project, and Datasetsare displayed depending on the current query in the Searchfield.
Semantic search filters
For semantic search, the following filters are available:
- Scope: search across the organization (default), the current project, or only for starred resources. For more information, see the Search scope section of this document.
- Systems: the Google Cloud service that the resource belongs to, such as BigQuery. The Dataplex Universal Catalog system contains entry groups .
- Projects: the projects to search in.
- Type: the resource type, such as BigQuery connection, Cloud Storage bucket, or database. Depending on the resource type, you can also filter by subtype, such as the connection type or SQL dialect.
- Select locations: the locations to search in.
- Select datasets: the search results are limited to BigQuery resources that belong to the selected BigQuery datasets. In the Type to filterfield, enter the name of the dataset.
- Aspect types: the Dataplex Universal Catalog aspect types that are associated with the resource that you're searching for. To filter by aspect values, click Filter on annotation values, and then select the values.
Before you begin
Before you perform search, ensure that you are granted the required roles and have enabled the necessary API.
Required roles
This section describes the roles and permissions required to search for resources and to access the search results.
For more information about granting roles, see Manage access .
You might also be able to get the required permissions through custom roles or other predefined roles .
Required roles for searching entries
To search for entries, you need at least one of the following IAM roles on the project that is used for search: Dataplex Catalog Admin, Dataplex Catalog Editor, or Dataplex Catalog Viewer. Permissions on search results are checked independently of the selected project.
Required roles for accessing search results
The search results in Dataplex Universal Catalog are scoped according to your role. To search for an asset in Dataplex Universal Catalog, you must have permissions to access the corresponding resource in the source system. For more information, see the Search scope section of this document.
For example, to search for BigQuery datasets, tables, views, and models, you need respective permissions for those entries. For more information, see BigQuery permissions .
The following list describes the minimum permissions required:
- To search for a table, you need
bigquery.tables.get
permission for that table. - To search for a dataset, you need
bigquery.datasets.get
permission for that dataset. - To search for metadata for a dataset or a table, you need the
BigQuery Metadata Viewer role (
roles/bigquery.metadataViewer
).
As another example, to search for Cloud SQL instances, databases, schemas, tables, and views, you need respective permissions on those entries. For more information, see Cloud SQL roles and permissions .
To search for custom entries, you need the Dataplex Catalog Viewer role
( roles/dataplex.catalogViewer
).
Enable the API
Enable the Dataplex API.
Search for resources
Console
To search for resources, follow these steps:
-
In the Google Cloud console, go to the Dataplex Universal Catalog Searchpage.
-
For Choose search platform, select Dataplex Universal Catalog.
Selecting Dataplex Universal Cataloglets you search the Dataplex Universal Catalog metadata storage. Selecting Data Cataloglets you search your Data Catalog repository, if you're an existing Data Catalog user.
-
To use keyword search, follow these steps:
-
In the Find resources across projectsfield, enter your query.
-
To refine your search, use the Filterspanel. For the list of available filters, see Keyword search filters .
You can manually add the following filters:
- Add a project filter: in Project, click Add project. Search for a specific project, select the project, and then click Open.
- Add an aspect type filter: in Aspects, click the Add more aspect typesmenu. Search for a specific template, select it, and then click OK.
-
Optional: In addition to the assets available to you, you can search for resources that are publicly available in Google Cloud by selecting Include public datasets.
Use the following tips to construct a search query:
- Enclose your search expression in quotes if it contains spaces. For
example,
"search terms"
. - Precede a keyword with
NOT
to match the logical negation of thekeyword:term
filter. You can also useAND
andOR
Boolean operators to combine search expressions. TheAND
,OR
, andNOT
operators aren't case-sensitive.
For example,
NOT column:term
lists all columns except those that match the specified term. For a list of keywords and other terms you can use in a Dataplex Universal Catalog search expression, see Search syntax . - Enclose your search expression in quotes if it contains spaces. For
example,
-
-
To use semantic search, follow these steps:
Preview
This feature is subject to the "Pre-GA Offerings Terms" in the General Service Terms section of the Service Specific Terms . Pre-GA features are available "as is" and might have limited support. For more information, see the launch stage descriptions .
-
Click Try semantic search.
-
In the Find resources across projects with semanticfield, enter your query in natural language and then press Enter .
The following are some sample queries:
-
Show me the datasets that contain taxi information
-
Find data on vaccine distribution across different countries
-
Get tables with historical temperature data for major world cities
-
Search for hurricane tracking and storm activity datasets
-
Population data by country
-
-
To refine your search, click Filters. For the list of available filters, see semantic search filters .
-
-
To view more information about the searched resource, in the search results, click the resource name. This opens the entry details page.
gcloud
To search for resources, use the gcloud dataplex entries search
command
.
REST
To search for resources, use the searchEntries
method
.
View details of an entry
Console
Use Dataplex Universal Catalog search to view the details of an entry.
-
Search for an entry in Dataplex Universal Catalog.
-
In the search results, click the entry for which you want to view the details.
The entry details page opens. The page includes the following sections:
- Entry details: includes information such as the entry type, system, platform, fully qualified name, creation time, last modification time, description, and stewards.
- Overview: an overview of the entry, if available.
- Aspects: the required and optional aspects defined for the entry. For more information, see Categories of aspects .
gcloud
To view the details of an entry, use the gcloud dataplex entries lookup
command
.
REST
To view the details of an entry, use the lookupEntry
method
.
What's next
- Understand search syntax for Dataplex Universal Catalog .
- Learn more about metadata management in Dataplex Universal Catalog .
- Learn how to enrich entries with metadata using aspects .
- Learn how to manage entries and ingest custom sources .