As of April 20th, 2026, BigLake is now called Lakehouse for Apache Iceberg. BigLake metastore is now called the Lakehouse runtime catalog. Lakehouse APIs, client libraries, CLI commands, and IAM names remain unchanged and still reference BigLake.

Configure the Lakehouse runtime catalog for Managed Service for Apache Spark using Iceberg 1.10

Configuring the custom Apache Iceberg catalog for BigQuery connects your Apache Spark and Apache Flink engines to the Lakehouse runtime catalog .

With this integration established for Lakehouse for Apache Iceberg , you create a single, shared metadata layer to manage your open table formats using either Managed Service for Apache Spark clusters or Managed Service for Apache Spark .

Before you begin

Enable billing for your Google Cloud project. Learn how to check if billing is enabled on a project .
Enable the BigQuery, and Managed Service for Apache Spark APIs.

Enable the APIs
Understand Lakehouse runtime catalog .

Required roles

To get the permissions that you need to configure the Lakehouse runtime catalog, ask your administrator to grant you the following IAM roles:

Create a Managed Service for Apache Spark cluster: Dataproc Worker ( roles/dataproc.worker ) on the Compute Engine default service account in the project
Create Lakehouse runtime catalog tables:
- Dataproc Worker ( roles/dataproc.worker ) on the Managed Service for Apache Spark VM service account in the project
- BigQuery Data Editor ( roles/bigquery.dataEditor ) on the Managed Service for Apache Spark VM service account in the project
- Storage Object User ( roles/storage.objectUser ) on the Managed Service for Apache Spark VM service account in the project
Query Lakehouse runtime catalog tables:
- BigQuery Data Viewer ( roles/bigquery.dataViewer ) on the project
- BigQuery User ( roles/bigquery.user ) on the project
- Storage Object Viewer ( roles/storage.objectViewer ) on the project

For more information about granting roles, see Manage access to projects, folders, and organizations .

You might also be able to get the required permissions through custom roles or other predefined roles .

Configure your metastore with Managed Service for Apache Spark

You can configure Lakehouse runtime catalog with Managed Service for Apache Spark using either Spark or Flink:

Spark

Configure a new cluster. To create a new Managed Service for Apache Spark cluster, run the following gcloud dataproc clusters create command , which contains the settings that you need to use Lakehouse runtime catalog:
```
gcloud  
dataproc  
clusters  
create  
 CLUSTER_NAME 
  
 \ 
  
--project = 
 PROJECT_ID 
  
 \ 
  
--region = 
 LOCATION 
  
 \ 
  
--single-node
```
Replace the following:
- CLUSTER_NAME : a name for your Managed Service for Apache Spark cluster.
- PROJECT_ID : the ID of the Google Cloud project where you're creating the cluster.
- LOCATION : the Compute Engine region where you're creating the cluster.

Submit a Spark job using one of the following methods:

Google Cloud CLI

gcloud  
dataproc  
 jobs 
  
submit  
spark-sql  
 \ 
  
--project = 
 PROJECT_ID 
  
 \ 
  
--cluster = 
 CLUSTER_NAME 
  
 \ 
  
--region = 
 REGION 
  
 \ 
  
--jars = 
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/1.10.0/iceberg-spark-runtime-3.5_2.12-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-bigquery/1.10.0/iceberg-bigquery-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp-bundle/1.10.0/iceberg-gcp-bundle-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp/1.10.0/iceberg-gcp-1.10.0.jar  
 \ 
  
--properties = 
spark.sql.catalog. CATALOG_NAME 
 = 
org.apache.iceberg.spark.SparkCatalog, \ 
  
spark.sql.catalog. CATALOG_NAME 
.type = 
bigquery, \ 
  
spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.project-id = 
 PROJECT_ID 
, \ 
  
spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.location = 
 LOCATION 
, \ 
  
spark.sql.catalog. CATALOG_NAME 
.warehouse = 
 WAREHOUSE_DIRECTORY 
  
 \ 
  
--execute = 
 " SPARK_SQL_COMMAND 
"

Replace the following:

PROJECT_ID : the ID of the Google Cloud project that contains the Managed Service for Apache Spark cluster.
CLUSTER_NAME : the name of the Managed Service for Apache Spark cluster that you're using to run the Spark SQL job.
REGION : the Compute Engine region where your cluster is located.
LOCATION : the location of the BigQuery resources.
CATALOG_NAME : the name of the Spark catalog to use with your SQL job.
WAREHOUSE_DIRECTORY : the Cloud Storage folder that contains your data warehouse. This value starts with gs:// .
SPARK_SQL_COMMAND : the Spark SQL query that you want to run. This query includes the commands to create your resources. For example, to create a namespace and table.

spark-sql CLI

In the Google Cloud console, go to the VM Instancespage.

Go to VM Instances
To connect to a Managed Service for Apache Spark VM instance, click SSHin the row that lists the Managed Service for Apache Spark cluster main VM instance name, which is the cluster name followed by an -m suffix. The output is similar to the following:
```
 Connected, host fingerprint: ssh-rsa ...
Linux cluster-1-m 3.16.0-0.bpo.4-amd64 ...
...
example-cluster@cluster-1-m:~$ 
```

In the terminal, run the following Lakehouse runtime catalog initialization command:

spark-sql  
 \ 
  
--jars  
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/1.10.0/iceberg-spark-runtime-3.5_2.12-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-bigquery/1.10.0/iceberg-bigquery-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp-bundle/1.10.0/iceberg-gcp-bundle-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp/1.10.0/iceberg-gcp-1.10.0.jar  
 \ 
  
--conf  
spark.sql.catalog. CATALOG_NAME 
 = 
org.apache.iceberg.spark.SparkCatalog  
 \ 
  
--conf  
spark.sql.catalog. CATALOG_NAME 
.type = 
bigquery  
 \ 
  
--conf  
spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.project-id = 
 PROJECT_ID 
  
 \ 
  
--conf  
spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.location = 
 LOCATION 
  
 \ 
  
--conf  
spark.sql.catalog. CATALOG_NAME 
.warehouse = 
 WAREHOUSE_DIRECTORY

Replace the following:

CATALOG_NAME : the name of the Spark catalog that you're using with your SQL job.
PROJECT_ID : the Google Cloud project ID of the Lakehouse runtime catalog that your Spark catalog links with.
LOCATION : the Google Cloud location of the Lakehouse runtime catalog.
WAREHOUSE_DIRECTORY : the Cloud Storage folder that contains your data warehouse. This value starts with gs:// .

After you successfully connect to the cluster, your Spark terminal displays the spark-sql prompt, which you can use to submit Spark jobs.

 spark-sql (default)>

Flink

Create a Managed Service for Apache Spark cluster with the optional Flink component enabled , and ensure that you're using Managed Service for Apache Spark 2.2 or later.
In the Google Cloud console, go to the VM instancespage.

Go to VM instances
In the list of virtual machine instances, click SSHto connect to the main Managed Service for Apache Spark cluster VM instance, which is listed as the cluster name followed by an -m suffix.

Configure the Apache Iceberg custom catalog plugin for Lakehouse runtime catalog:

 FLINK_VERSION 
 = 
 1 
.20 ICEBERG_VERSION 
 = 
 1 
.10.0 cd 
  
/usr/lib/flink

sudo  
wget  
-c  
https://repo.maven.apache.org/maven2/org/apache/iceberg/iceberg-flink-runtime- ${ 
 FLINK_VERSION 
 } 
/ ${ 
 ICEBERG_VERSION 
 } 
/iceberg-flink-runtime- ${ 
 FLINK_VERSION 
 } 
- ${ 
 ICEBERG_VERSION 
 } 
.jar  
-P  
lib

sudo  
wget  
-c  
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-bigquery/ ${ 
 ICEBERG_VERSION 
 } 
/iceberg-bigquery- ${ 
 ICEBERG_VERSION 
 } 
.jar  
-P  
lib

sudo  
wget  
-c  
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp-bundle/ ${ 
 ICEBERG_VERSION 
 } 
/iceberg-gcp-bundle- ${ 
 ICEBERG_VERSION 
 } 
.jar  
-P  
lib

sudo  
wget  
-c  
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp/ ${ 
 ICEBERG_VERSION 
 } 
/iceberg-gcp- ${ 
 ICEBERG_VERSION 
 } 
.jar  
-P  
lib

Start the Flink session on YARN:

 HADOOP_CLASSPATH 
 = 
 ` 
hadoop  
classpath ` 
sudo  
bin/yarn-session.sh  
-nm  
flink-dataproc  
-d

sudo  
bin/sql-client.sh  
embedded  
 \ 
-s  
yarn-session

Create a catalog in Flink:
```
 CREATE 
  
 CATALOG 
  
  CATALOG_NAME 
 
  
 WITH 
  
 ( 
 'type' 
 = 
 'iceberg' 
 , 
 'warehouse' 
 = 
 ' WAREHOUSE_DIRECTORY 
' 
 , 
 'catalog-impl' 
 = 
 'org.apache.iceberg.gcp.bigquery.BigQueryMetastoreCatalog' 
 , 
 'gcp.bigquery.project-id' 
 = 
 ' PROJECT_ID 
' 
 , 
 'gcp.bigquery.location' 
 = 
 ' LOCATION 
' 
 ); 
```
Replace the following:
- CATALOG_NAME : the Flink catalog identifier, which is linked to a Lakehouse runtime catalog catalog.
- WAREHOUSE_DIRECTORY : the base path for the warehouse directory (the Cloud Storage folder where Flink creates files). This value starts with gs:// .
- PROJECT_ID : the project ID of the Lakehouse runtime catalog catalog that the Flink catalog links with.
- LOCATION : the location of the BigQuery resources.

Your Flink session is now connected to Lakehouse runtime catalog, and you can run Flink SQL commands.

Manage Lakehouse runtime catalog resources

Now that you're connected to Lakehouse runtime catalog, you can create and view resources based on the metadata stored in Lakehouse runtime catalog.

For example, try running the following commands in your interactive Flink SQL session to create an Apache Iceberg database and table.

Use the custom Apache Iceberg catalog:
```
 USE 
  
 CATALOG 
  
  CATALOG_NAME 
 
 ; 
```
Replace CATALOG_NAME with your Flink catalog identifier.

Create a database, which creates a dataset in BigQuery:

 CREATE 
  
 DATABASE 
  
 IF 
  
 NOT 
  
 EXISTS 
  
  DATABASE_NAME 
 
 ;

Replace DATABASE_NAME with the name of your new database.

Use the database that you created:
```
 USE 
  
  DATABASE_NAME 
 
 ; 
```

Create an Apache Iceberg table. The following creates an example sales table:

 CREATE 
  
 TABLE 
  
 IF 
  
 NOT 
  
 EXISTS 
  
  ICEBERG_TABLE_NAME 
 
  
 ( 
  
 order_number 
  
 BIGINT 
 , 
  
 price 
  
 DECIMAL 
 ( 
 32 
 , 
 2 
 ), 
  
 buyer 
  
 ROW 
 < 
 first_name 
  
 STRING 
 , 
  
 last_name 
  
 STRING 
 > 
 , 
  
 order_time 
  
 TIMESTAMP 
 ( 
 3 
 ) 
 );

Replace ICEBERG_TABLE_NAME with a name for your new table.

View table metadata:

 DESCRIBE 
  
 EXTENDED 
  
  ICEBERG_TABLE_NAME 
 
 ;

List tables in the database:
```
 SHOW 
  
 TABLES 
 ; 
```

Ingest data into your table

After you create an Apache Iceberg table in the previous section, you can use Flink DataGen as a data source to ingest real-time data into your table. The following steps are an example of this workflow:

Create a temporary table using DataGen:

 CREATE 
  
 TEMPORARY 
  
 TABLE 
  
  DATABASE_NAME 
 
 . 
  TEMP_TABLE_NAME 
 
 WITH 
  
 ( 
  
 'connector' 
  
 = 
  
 'datagen' 
 , 
  
 'rows-per-second' 
  
 = 
  
 '10' 
 , 
  
 'fields.order_number.kind' 
  
 = 
  
 'sequence' 
 , 
  
 'fields.order_number.start' 
  
 = 
  
 '1' 
 , 
  
 'fields.order_number.end' 
  
 = 
  
 '1000000' 
 , 
  
 'fields.price.min' 
  
 = 
  
 '0' 
 , 
  
 'fields.price.max' 
  
 = 
  
 '10000' 
 , 
  
 'fields.buyer.first_name.length' 
  
 = 
  
 '10' 
 , 
  
 'fields.buyer.last_name.length' 
  
 = 
  
 '10' 
 ) 
 LIKE 
  
  DATABASE_NAME 
 
 . 
  ICEBERG_TABLE_NAME 
 
  
 ( 
 EXCLUDING 
  
 ALL 
 );

Replace the following:

DATABASE_NAME : the name of the database to store your temporary table.
TEMP_TABLE_NAME : a name for your temporary table.
ICEBERG_TABLE_NAME : the name of the Apache Iceberg table that you created in the previous section.

Set the parallelism to 1:

 SET 
  
 'parallelism.default' 
  
 = 
  
 '1' 
 ;

Set the checkpoint interval:

 SET 
  
 'execution.checkpointing.interval' 
  
 = 
  
 '10second' 
 ;

Set the checkpoint:

 SET 
  
 'state.checkpoints.dir' 
  
 = 
  
 'hdfs:///flink/checkpoints' 
 ;

Start the real-time streaming job:

 INSERT 
  
 INTO 
  
  ICEBERG_TABLE_NAME 
 
  
 SELECT 
  
 * 
  
 FROM 
  
  TEMP_TABLE_NAME 
 
 ;

The output is similar to the following:

[INFO] Submitting SQL update statement to the cluster...
[INFO] SQL update statement has been successfully submitted to the cluster:
Job ID: 0de23327237ad8a811d37748acd9c10b

To check the status of the streaming job, do the following:
1. In the Google Cloud console, go to the Clusterspage.
  
  Go to Clusters
2. Select your cluster.
3. Click the Web interfacestab.
4. Click the YARN ResourceManagerlink.
5. In the YARN ResourceManagerinterface, find your Flink session, and click the ApplicationMasterlink under Tracking UI.
6. In the Statuscolumn, confirm that your job status is Running.

Query streaming data in the Flink SQL client:

 SELECT 
  
 * 
  
 FROM 
  
  ICEBERG_TABLE_NAME 
 
 /*+ OPTIONS('streaming'='true', 'monitor-interval'='3s')*/ 
 ORDER 
  
 BY 
  
 order_time 
  
 desc 
 LIMIT 
  
 20 
 ;

Query streaming data in BigQuery:

 SELECT 
  
 * 
  
 FROM 
  
 ` DATABASE_NAME 
. ICEBERG_TABLE_NAME 
` 
 ORDER 
  
 BY 
  
 order_time 
  
 desc 
 LIMIT 
  
 20 
 ;

Terminate the streaming job in the Flink SQL client:
```
 STOP 
  
 JOB 
  
 ' JOB_ID 
' 
 ; 
```
Replace JOB_ID with the job ID that was displayed in the output when you created the streaming job.

Configure your metastore with Managed Service for Apache Spark

You can configure Lakehouse runtime catalog with Managed Service for Apache Spark using either Spark SQL or PySpark.

Spark SQL

Create a SQL file with the Spark SQL commands that you want to run in Lakehouse runtime catalog. For example, this command creates a namespace and a table:

 SET 
  
 `spark.sql.catalog. CATALOG_NAME 
` 
 = 
 `org.apache.iceberg.spark.SparkCatalog` 
 ; 
 SET 
  
 `spark.sql.catalog. CATALOG_NAME 
.type` 
 = 
 `bigquery` 
 ; 
 SET 
  
 `spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.project-id` 
 = 
 ` PROJECT_ID 
` 
 ; 
 SET 
  
 `spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.location` 
 = 
 ` LOCATION 
` 
 ; 
 SET 
  
 `spark.sql.catalog. CATALOG_NAME 
.warehouse` 
 = 
 ` WAREHOUSE_DIRECTORY 
` 
 ; 
 CREATE 
  
 NAMESPACE 
  
 ` CATALOG_NAME 
` 
 . 
  NAMESPACE_NAME 
 
 ; 
 CREATE 
  
 TABLE 
  
 ` CATALOG_NAME 
` 
 . 
  NAMESPACE_NAME 
 
 . 
  TABLE_NAME 
 
  
 ( 
 id 
  
 int 
 , 
  
 data 
  
 string 
 ) 
  
 USING 
  
 ICEBERG 
  
 LOCATION 
  
 ' WAREHOUSE_DIRECTORY 
' 
 ;

Replace the following:

CATALOG_NAME : the catalog name that references your Spark table.
NAMESPACE_NAME : the namespace name that references your Spark table.
TABLE_NAME : a table name for your Spark table.
WAREHOUSE_DIRECTORY : the URI of the Cloud Storage folder where your data warehouse is stored.

Submit a Spark SQL batch job by running the following gcloud dataproc batches submit spark-sql command :

gcloud  
dataproc  
batches  
submit  
spark-sql  
 SQL_SCRIPT_PATH 
  
 \ 
  
--project = 
 PROJECT_ID 
  
 \ 
  
--region = 
 REGION 
  
 \ 
  
--subnet = 
projects/ PROJECT_ID 
/regions/ REGION 
/subnetworks/ SUBNET_NAME 
  
 \ 
  
--deps-bucket = 
 BUCKET_PATH 
  
 \ 
  
--jars = 
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/1.10.0/iceberg-spark-runtime-3.5_2.12-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-bigquery/1.10.0/iceberg-bigquery-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp-bundle/1.10.0/iceberg-gcp-bundle-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp/1.10.0/iceberg-gcp-1.10.0.jar

Replace the following:

SQL_SCRIPT_PATH : the path to the SQL file that the batch job uses.
PROJECT_ID : the ID of the Google Cloud project to run the batch job in.
REGION : the region where your workload runs.
SUBNET_NAME (optional): the name of a VPC subnet in the REGION that meets the session subnet requirements .
BUCKET_PATH : the location of the Cloud Storage bucket to upload workload dependencies. The WAREHOUSE_DIRECTORY is located in this bucket. The gs:// URI prefix of the bucket is not required. You can specify the bucket path or bucket name, for example, mybucketname1 .
LOCATION : the location to run the batch job in.

For more information on submitting Spark batch jobs, see Run a Spark batch workload .

PySpark

Create a Python file with the PySpark commands that you want to run in Lakehouse runtime catalog.

For example, the following command sets up a Spark environment to interact with Apache Iceberg tables stored in Lakehouse runtime catalog. The command then creates a new namespace and an Apache Iceberg table within that namespace.

 from 
  
 pyspark 
 . 
 sql 
  
 import 
  
 SparkSession 
 spark 
  
 = 
  
 SparkSession 
 . 
 builder 
  
 \ 
 . 
 appName 
 ( 
 "Lakehouse runtime catalog Iceberg" 
 ) 
  
 \ 
 . 
 config 
 ( 
 "spark.sql.catalog. CATALOG_NAME 
" 
 , 
  
 "org.apache.iceberg.spark.SparkCatalog" 
 ) 
  
 \ 
 . 
 config 
 ( 
 "spark.sql.catalog. CATALOG_NAME 
.type" 
 , 
  
 "bigquery" 
 ) 
  
 \ 
 . 
 config 
 ( 
 "spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.project-id" 
 , 
  
 " PROJECT_ID 
" 
 ) 
  
 \ 
 . 
 config 
 ( 
 "spark.sql.catalog. CATALOG_NAME 
.gcp.bigquery.location" 
 , 
  
 " LOCATION 
" 
 ) 
  
 \ 
 . 
 config 
 ( 
 "spark.sql.catalog. CATALOG_NAME 
.warehouse" 
 , 
  
 " WAREHOUSE_DIRECTORY 
" 
 ) 
  
 \ 
 . 
 getOrCreate 
 () 
 spark 
 . 
 sql 
 ( 
 "USE ` CATALOG_NAME 
`;" 
 ) 
 spark 
 . 
 sql 
 ( 
 "CREATE NAMESPACE IF NOT EXISTS NAMESPACE_NAME 
;" 
 ) 
 spark 
 . 
 sql 
 ( 
 "USE NAMESPACE_NAME 
;" 
 ) 
 spark 
 . 
 sql 
 ( 
 "CREATE TABLE TABLE_NAME 
(id int, data string) USING ICEBERG LOCATION ' WAREHOUSE_DIRECTORY 
';" 
 )

Replace the following:

PROJECT_ID : the ID of the Google Cloud project to run the batch job in.
LOCATION : the location where the BigQuery resources are located.
CATALOG_NAME : the catalog name that references your Spark table.
TABLE_NAME : a table name for your Spark table.
WAREHOUSE_DIRECTORY : the URI of the Cloud Storage folder where your data warehouse is stored.
NAMESPACE_NAME : the namespace name that references your Spark table.

Submit the batch job using the following gcloud dataproc batches submit pyspark command :

gcloud  
dataproc  
batches  
submit  
pyspark  
 PYTHON_SCRIPT_PATH 
  
 \ 
  
--version = 
 2 
.2  
 \ 
  
--project = 
 PROJECT_ID 
  
 \ 
  
--region = 
 REGION 
  
 \ 
  
--subnet = 
projects/ PROJECT_ID 
/regions/ REGION 
/subnetworks/ SUBNET_NAME 
  
 \ 
  
--deps-bucket = 
 BUCKET_PATH 
  
 \ 
  
--jars = 
https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/1.10.0/iceberg-spark-runtime-3.5_2.12-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-bigquery/1.10.0/iceberg-bigquery-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp-bundle/1.10.0/iceberg-gcp-bundle-1.10.0.jar,https://storage-download.googleapis.com/maven-central/maven2/org/apache/iceberg/iceberg-gcp/1.10.0/iceberg-gcp-1.10.0.jar

Replace the following:

PYTHON_SCRIPT_PATH : the path to the Python script that the batch job uses.
PROJECT_ID : the ID of the Google Cloud project to run the batch job in.
REGION : the region where your workload runs.
BUCKET_PATH : the location of the Cloud Storage bucket to upload workload dependencies. The gs:// URI prefix of the bucket is not required. You can specify the bucket path or bucket name, for example, mybucketname1 .

For more information on submitting PySpark batch jobs, see the PySpark gcloud reference .

Configure the Lakehouse runtime catalog for Managed Service for Apache Spark using Iceberg 1.10 Stay organized with collections Save and categorize content based on your preferences.

Before you begin

Required roles

Configure your metastore with Managed Service for Apache Spark

Spark

Google Cloud CLI

spark-sql CLI

Flink

Manage Lakehouse runtime catalog resources

Ingest data into your table

Configure your metastore with Managed Service for Apache Spark

Spark SQL

PySpark

What's next

Configure the Lakehouse runtime catalog for Managed Service for Apache Spark using Iceberg 1.10