This page describes how to configure change data capture (CDC) to stream data from a self-managed PostgreSQL database to a supported destination , such as BigQuery or Cloud Storage.
Enable logical replication on the database
-
Set the
wal_levelparameter in your database by addingwal_level=logicalto thepostgresql.conffile. -
Restart the server.
Create a publication and a replication slot
-
Sign in to a PostgreSQL console as a superuser.
-
Create a publication. We recommend that you create a publication only for the tables that you want to replicate. This allows Datastream to read-only the relevant data, and lowers the load on the database and Datastream:
CREATE PUBLICATION PUBLICATION_NAME FOR TABLE SCHEMA1 . TABLE1 , SCHEMA2 . TABLE2 ;
Replace the following:
- PUBLICATION_NAME : The name of your publication. You'll need to provide this name when you create a stream in the Datastream stream creation wizard.
- SCHEMA : The name of the schema that contains the table.
- TABLE : The name of the table that you want to replicate.
You can create a publication for all tables in a schema. This approach lets you replicate changes for tables in the specified list of schemas, including tables that you create in the future:
CREATE PUBLICATION PUBLICATION_NAME FOR TABLES IN SCHEMA SCHEMA1 , SCHEMA2 ;
You can also create a publication for all tables in your database. Note that this approach increases the load on both the source database and Datastream:
CREATE PUBLICATION PUBLICATION_NAME FOR ALL TABLES;
-
Create a replication slot by entering the following SQL command:
SELECT PG_CREATE_LOGICAL_REPLICATION_SLOT(' REPLICATION_SLOT_NAME ', 'pgoutput');
Replace the following:
- REPLICATION_SLOT_NAME : The name of your replication slot. You will need to provide this name when you create a stream in the Datastream stream creation wizard.
Create a Datastream user
-
To create a Datastream user, enter the following SQL commands:
CREATE USER USER_NAME WITH ENCRYPTED PASSWORD ' USER_PASSWORD ';
Replace the following:
- USER_NAME : The name of the Datastream user that you want to create.
- USER_PASSWORD : The login password for the Datastream user that you want to create.
-
Grant the following privileges to the user you created:
ALTER ROLE USER_NAME WITH REPLICATION; GRANT SELECT ON ALL TABLES IN SCHEMA SCHEMA_NAME TO USER_NAME ; GRANT USAGE ON SCHEMA SCHEMA_NAME TO USER_NAME ; ALTER DEFAULT PRIVILEGES IN SCHEMA SCHEMA_NAME GRANT SELECT ON TABLES TO USER_NAME ;
Replace the following:
- USER_NAME : The user to whom you want to grant the privileges.
- SCHEMA_NAME : The name of the schema to which you want to grant the privileges.
What's next
- Learn more about how Datastream works with PostgreSQL sources .

