Technical Article

Real-Time SQL Server to BigQuery Streaming ETL using CDC

CDC Changes: The script queries the CDC tables in SQL Server to retrieve the changes (inserts, updates, deletes) since the last sync. Each change is processed with a mapped operation type (INSERT, UPDATE, DELETE).
Real-Time Streaming to BigQuery: The captured changes are streamed directly to BigQuery using its real-time insert_rows_json method, avoiding the need for batch uploads via Google Cloud Storage.
Tracking Last Sync Time: The script tracks the last synchronization time and updates it after every successful sync, ensuring no data is missed.
Low Latency: By continuously querying the CDC tables and streaming the changes, the script achieves near real-time data synchronization.

(1)

You rated this post out of 5. Change rating

2024-11-13 (first published: )

632 reads

Blogs

Using CAT for Testing of Data Agents

By

In last months one of the scenarios where you can use AI has been...

Are you getting value from your reporting?

By

Do you spend so long manipulating your data into something vaguely useful that you...

The Book of Redgate: SQL Server Central

By

It was neat to stumble on this in the book, a piece by me,...

Read the latest Blogs

Forums

Foreach Loop still executes after process and delete all the folders

By robink

I have two challenges XML source control not displaying the XML file parent node...

Which 'Where' statement conditional upon a variable

By DaveBriCam

Thanks in advance for any clues on this. I am trying to write a...

Backup to Immutable Storage

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Backup to Immutable Storage

Visit the forum

Question of the Day

Backup to Immutable Storage

In SQL Server 2025, a backup can be made on Azure Immutable Storage. What changes in how the backup is created?

See possible answers