This page describes how to configure change data capture (CDC) to stream data from an Amazon Aurora PostgreSQL database to a supported destination , such as BigQuery or Cloud Storage.
Create a parameter group
- Launch your Amazon RDS Dashboard.
- In the Navigation Drawer , click Parameter Groups , and then click Create Parameter Group . The Create Parameter Group page appears.
- Use the following table to populate the fields of this page, and then click Create
:
Field Description Parameter group family Select the family that matches your database. Type Select DB Cluster Parameter Group . Group name Provide a name for the parameter group. Description Provide a description for the parameter group. - Select the checkbox to the left of your newly created parameter group, and then, under Parameter group actions , click Edit .
- Change the value of the
rds.logical_replicationparameter to 1 . - Click Save Changes .
Assign the parameter group to the database instance
- Launch your Amazon RDS Dashboard.
- In the Navigation Drawer, click Databases, and then select your database instance.
- From the Instance Actionsmenu, select Modify. The Modify DB Instancedialog appears.
- In the Additional configurationsection, select the database cluster parameter group that you created .
- Set the Backup retention periodto 7 days.
- Click Continue.
- In the Scheduling of modificationspane, select the Apply immediatelyoption.
Reboot the database instance
- Launch your Amazon RDS Dashboard.
- In the Navigation Drawer, click Databases, and then select your database instance.
- In the Actionsdrop-down menu, select Rebootand then Confirm.
Create a publication and a replication slot
-
Create a publication for the changes in the tables you want to replicate. We recommend that you create a publication only for the tables that you want to replicate. This allows Datastream to read-only the relevant data, and lowers the load on the database and Datastream:
CREATE PUBLICATION PUBLICATION_NAME FOR TABLE SCHEMA1 . TABLE1 , SCHEMA2 . TABLE2 ;
Replace the following:
- PUBLICATION_NAME : The name of your publication. You'll need to provide this name when you create a stream in the Datastream stream creation wizard.
- SCHEMA : The name of the schema that contains the table.
- TABLE : The name of the table that you want to replicate.
You can also create a publication for all tables in your database. Note that this approach increases the load on both the source database and Datastream:
CREATE PUBLICATION PUBLICATION_NAME FOR ALL TABLES;
-
Create a replication slot by entering the following PostgreSQL command:
SELECT PG_CREATE_LOGICAL_REPLICATION_SLOT(' REPLICATION_SLOT_NAME ', 'pgoutput');
Replace the following:
- REPLICATION_SLOT_NAME : The name of your replication slot. You'll need to provide this name when you create a stream in the Datastream stream creation wizard.
Create a Datastream user
-
To create a Datastream user, enter the following PostgreSQL command:
CREATE USER USER_NAME WITH ENCRYPTED PASSWORD ' USER_PASSWORD ';
Replace the following:
- USER_NAME : The name of the Datastream user that you want to create.
- USER_PASSWORD : The password for the Datastream user that you want to create.
-
Grant the following privileges to the user you created:
GRANT RDS_REPLICATION TO USER_NAME ; GRANT SELECT ON ALL TABLES IN SCHEMA SCHEMA_NAME TO USER_NAME ; GRANT USAGE ON SCHEMA SCHEMA_NAME TO USER_NAME ; ALTER DEFAULT PRIVILEGES IN SCHEMA SCHEMA_NAME GRANT SELECT ON TABLES TO USER_NAME ;
Replace the following:
- SCHEMA_NAME : The name of the schema to which you want to grant the privileges.
- USER_NAME : The user to whom you want to grant the privileges.
What's next
- Learn more about how Datastream works with PostgreSQL sources .

