ETL/SSIS/Azure Data Factory

Technical Article

Real-Time SQL Server to BigQuery Streaming ETL using CDC

  • Script

CDC Changes: The script queries the CDC tables in SQL Server to retrieve the changes (inserts, updates, deletes) since the last sync. Each change is processed with a mapped operation type (INSERT, UPDATE, DELETE).
Real-Time Streaming to BigQuery: The captured changes are streamed directly to BigQuery using its real-time insert_rows_json method, avoiding the need for batch uploads via Google Cloud Storage.
Tracking Last Sync Time: The script tracks the last synchronization time and updates it after every successful sync, ensuring no data is missed.
Low Latency: By continuously querying the CDC tables and streaming the changes, the script achieves near real-time data synchronization.

(1)

You rated this post out of 5. Change rating

2024-11-13 (first published: )

606 reads

Blogs

I recommend Smart Brevity (book) for communications

By

Following the advice in Smart Brevity improves communication.

SQL Server 2025 Developer Edition – One size fits all

By

Microsoft has released SQL Server 2025, bringing big improvements to its main database engine....

Monday Monitor Tips: Learning While Using the Tool

By

A customer was asking about what certain items in Redgate Monitor mean. They have...

Read the latest Blogs

Forums

Using Python notebooks to save money in Fabric: The Fabric Modern Data Platform

By John Miner

Comments posted to this topic are about the item Using Python notebooks to save...

Your AI Successes

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Your AI Successes

Switch partition: NULL boundary id

By vpolasa

I have table: t1 in schema1 with 19 billion records. I have another table...

Visit the forum

Question of the Day

Encoding NULL

What is returned from this code in SQL Server 2025?

SELECT BASE64_ENCODE(NULL)

See possible answers