As of April 10, 2026, Dataplex Universal Catalog is now called Knowledge Catalog. The API, client library, CLI, and IAM names remain unchanged. For more information, see Introducing the Google Cloud Knowledge Catalog .

Search multi-region lineage using server-side automation

This document describes how to look up multi-level, cross-regional data lineage by using the searchLineageStreaming API.

The searchLineageStreaming API performs a breadth-first search in a specified direction (upstream or downstream) starting from a defined set of root entities, and returns a unified lineage graph as a real-time streaming response.

Unlike standard lineage lookup APIs that might time out on massive multi-project graphs, searchLineageStreaming delivers real-time, chunked responses. Use this API when building tools that need to traverse broad, deep, or cross-regional data architectures without request timeouts.

For more information, see About multi-region lineage search .

Key capabilities

The searchLineageStreaming API includes the following capabilities:

Breadth-first search: Traverses the lineage graph layer by layer, accurately calculating the depth of each connected asset.
Streaming response: Returns subgraphs and lineage links as they are discovered by the backend system. This is highly efficient for broad or deep lineage graphs and prevents request timeouts.
Multi-location and multi-project traversal: Although you specify only one billing project in the request path, the API automatically discovers and traverses lineage links across multiple Google Cloud projects and geographical locations, provided you have the required permissions.
Fine-grained column-level lineage: Supports searching for column-level dependencies between assets.
Wildcard lookups: Lets you to retrieve all column-level lineage for a specific entity by suffixing the fully qualified name (FQN) with * .
Pipeline insights: Optionally retrieves metadata about the transformation pipelines (processes) that created the lineage links.

Before you begin

Before you make requests to the API, ensure that you have met the following security and environmental prerequisites:

Required roles

To get the permissions that you need to search for data lineage links, ask your administrator to grant you the Data Lineage Viewer ( roles/datalineage.viewer ) IAM role on the projects where the lineage links and processes are stored. For more information about granting roles, see Manage access to projects, folders, and organizations .

This predefined role contains the permissions required to search for data lineage links. To see the exact permissions that are required, expand the Required permissionssection:

Required permissions

The following permissions are required to search for data lineage links:

Search entity-level lineage: datalineage.events.get on the project where the link is stored
Search column-level lineage: datalineage.events.getFields on the project where the link is stored
Retrieve full pipeline process details: datalineage.processes.get on the project where the process is stored

You might also be able to get these permissions with custom roles or other predefined roles .

Resource scoping

When you configure your API request, you must distinguish between the resource used for administrative billing and the actual locations scanned by the API:

Billing parent path: The parent path in the URL request must use the format projects/ project /locations/ location . This specific project-location pair is used exclusively to evaluate billing quotas and API rate limits.
Target locations: Explicitly define the regions you want the backend to scan in the locations array inside the request body.

Authentication setup

Initialize an environment variable with a Google Cloud access token to authenticate your curl commands:

 export ACCESS_TOKEN=$(gcloud auth print-access-token)

Usage examples

The following examples use the endpoint datalineage.googleapis.com .

Search multi-level, multi-project lineage

To execute a deep lineage search that traverses across multiple depths of the graph and scans across distinct Google Cloud projects, define the following variables:

Set limits.maxDepth to your target traversal depth (accepts values from 1 to 100 ).
Populate the locations array with the target regions you want the backend to cross-reference (for example, ["us", "us-east1"] ).

C#

Before trying this sample, follow the C# setup instructions in the Knowledge Catalog quickstart using client libraries . For more information, see the Knowledge Catalog C# API reference documentation .

To authenticate to Knowledge Catalog, set up Application Default Credentials. For more information, see Set up authentication for a local development environment .

  using 
  
  Google.Api.Gax.Grpc 
 
 ; 
 using 
  
  Google.Api.Gax.ResourceNames 
 
 ; 
 using 
  
  Google.Cloud.DataCatalog.Lineage.V1 
 
 ; 
 using 
  
 System.Threading.Tasks 
 ; 
 public 
  
 sealed 
  
 partial 
  
 class 
  
 GeneratedLineageClientSnippets 
 { 
  
 /// <summary>Snippet for SearchLineageStreaming</summary> 
  
 /// <remarks> 
  
 /// This snippet has been automatically generated and should be regarded as a code template only. 
  
 /// It will require modifications to work: 
  
 /// - It may require correct/in-range values for request initialization. 
  
 /// - It may require specifying regional endpoints when creating the service client as shown in 
  
 ///   https://cloud.google.com/dotnet/docs/reference/help/client-configuration#endpoint. 
  
 /// </remarks> 
  
 public 
  
 async 
  
 Task 
  
 SearchLineageStreamingRequestObject 
 () 
  
 { 
  
 // Create client 
  
  LineageClient 
 
  
 lineageClient 
  
 = 
  
  LineageClient 
 
 . 
  Create 
 
 (); 
  
 // Initialize request argument(s) 
  
  SearchLineageStreamingRequest 
 
  
 request 
  
 = 
  
 new 
  
  SearchLineageStreamingRequest 
 
  
 { 
  
 ParentAsLocationName 
  
 = 
  
  LocationName 
 
 . 
  FromProjectLocation 
 
 ( 
 "[PROJECT]" 
 , 
  
 "[LOCATION]" 
 ), 
  
 Locations 
  
 = 
  
 { 
  
 "" 
 , 
  
 }, 
  
 RootCriteria 
  
 = 
  
 new 
  
 SearchLineageStreamingRequest 
 . 
 Types 
 . 
 RootCriteria 
 (), 
  
 Direction 
  
 = 
  
  SearchLineageStreamingRequest 
 
 . 
  Types 
 
 . 
  SearchDirection 
 
 . 
  Unspecified 
 
 , 
  
 Filters 
  
 = 
  
 new 
  
 SearchLineageStreamingRequest 
 . 
 Types 
 . 
 SearchFilters 
 (), 
  
 Limits 
  
 = 
  
 new 
  
 SearchLineageStreamingRequest 
 . 
 Types 
 . 
 SearchLimits 
 (), 
  
 }; 
  
 // Make the request, returning a streaming response 
  
 using 
  
 LineageClient.SearchLineageStreamingStream 
  
 response 
  
 = 
  
 lineageClient 
 . 
  SearchLineageStreaming 
 
 ( 
 request 
 ); 
  
 // Read streaming responses from server until complete 
  
 // Note that C# 8 code can use await foreach 
  
 AsyncResponseStream<SearchLineageStreamingResponse> 
  
 responseStream 
  
 = 
  
 response 
 . 
 GetResponseStream 
 (); 
  
 while 
  
 ( 
 await 
  
 responseStream 
 . 
 MoveNextAsync 
 ()) 
  
 { 
  
  SearchLineageStreamingResponse 
 
  
 responseItem 
  
 = 
  
 responseStream 
 . 
 Current 
 ; 
  
 // Do something with streamed response 
  
 } 
  
 // The response stream has completed 
  
 } 
 }

Search multi-region lineage using server-side automation

Key capabilities

Before you begin

Required roles

Required permissions

Resource scoping

Authentication setup

Usage examples

Search multi-level, multi-project lineage

C#

C#

Java

Java

Node.js

Java

Python

Python

Ruby

Ruby

REST

curl (Linux, macOS, or Cloud Shell)

PowerShell (Windows)

Search multiple geographical locations

Retrieve process names for lineage links

Retrieve full process details using a FieldMask

Search both table-level and column-level lineage

Use wildcards for column-level lineage

Filter lineage results

Filter by dependency type

Exclude column-level lineage (Table-only search)

Filter by time range

Troubleshooting: Handle unreachable locations and partial graphs

What's next

Search multi-region lineage using server-side automation Stay organized with collections Save and categorize content based on your preferences.

Key capabilities

Before you begin

Required roles

Required permissions

Resource scoping

Authentication setup

Usage examples

Search multi-level, multi-project lineage

C#

C#

Java

Java

Node.js

Java

Python

Python

Ruby

Ruby

REST

curl (Linux, macOS, or Cloud Shell)

PowerShell (Windows)

Search multiple geographical locations

Retrieve process names for lineage links

Retrieve full process details using a FieldMask

Search both table-level and column-level lineage

Use wildcards for column-level lineage

Filter lineage results

Filter by dependency type

Exclude column-level lineage (Table-only search)

Filter by time range

Troubleshooting: Handle unreachable locations and partial graphs

What's next

Search multi-region lineage using server-side automation