Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[source-mssql] The issue of slower speed when there is no changed data during CDC sync. #51475

Open
moonjueun opened this issue Jan 13, 2025 · 1 comment
Labels
cdc community connectors/source/mssql team/db-dw-sources Backlog for Database and Data Warehouse Sources team team/use

Comments

@moonjueun
Copy link

Topic

Speed performance

Relevant information

I am currently syncing data from MSSQL to Snowflake using Airbyte CDC. However, when there is no changed data in MSSQL (no records loaded), it takes more than 5 minutes. On the other hand, when there is changed data, the sync completes within 1 minute. When there is no changed data, the following log is repeatedly shown, causing the delay. Is there another way to resolve this without increasing the scheduling time?

2025-01-13 07:10:48 �[44msource�[0m > INFO pool-3-thread-1 i.a.c.i.d.AirbyteDebeziumHandler$CapacityReportingBlockingQueue(reportQueueUtilization):48 CDC events queue stats: size=0, cap=10000, puts=1, polls=0
2025-01-13 07:10:48 �[44msource�[0m > INFO main i.a.c.i.d.i.DebeziumRecordIterator(computeNext):87 CDC events queue poll(): blocked for PT12.027558767S in its first call.
2025-01-13 07:10:48 �[44msource�[0m > INFO main i.a.c.i.d.i.DebeziumRecordIterator(computeNext):140 CDC events queue poll(): returned a heartbeat event: progressing to 000117c9:00010b18:0065.
2025-01-13 07:10:59 �[44msource�[0m > INFO pool-3-thread-1 i.a.c.i.d.AirbyteDebeziumHandler$CapacityReportingBlockingQueue(reportQueueUtilization):48 CDC events queue stats: size=0, cap=10000, puts=2, polls=0
2025-01-13 07:10:59 �[44msource�[0m > INFO main i.a.c.i.d.i.DebeziumRecordIterator(computeNext):87 CDC events queue poll(): blocked for PT10.523720143S after its previous call which was also logged.
2025-01-13 07:10:59 �[44msource�[0m > INFO main i.a.c.i.d.i.DebeziumRecordIterator(computeNext):140 CDC events queue poll(): returned a heartbeat event: no progress since last heartbeat.
2025-01-13 07:11:09 �[44msource�[0m > INFO pool-3-thread-1 i.a.c.i.d.AirbyteDebeziumHandler$CapacityReportingBlockingQueue(reportQueueUtilization):48 CDC events queue stats: size=0, cap=10000, puts=3, polls=0
2025-01-13 07:11:09 �[44msource�[0m > INFO main i.a.c.i.d.i.DebeziumRecordIterator(computeNext):87 CDC events queue poll(): blocked for PT10.52190653S after its previous call which was also logged.
2025-01-13 07:11:09 �[44msource�[0m > INFO main i.a.c.i.d.i.DebeziumRecordIterator(computeNext):140 CDC events queue poll(): returned a heartbeat event: no progress since last heartbeat.
2025-01-13 07:11:20 �[44msource�[0m > INFO pool-3-thread-1 i.d.c.c.BaseSourceTask(logStatistics):323 2 records sent during previous 00:00:21.047, last recorded offset of 
@marcosmarxm marcosmarxm added cdc connectors/source/mssql team/db-dw-sources Backlog for Database and Data Warehouse Sources team and removed needs-triage autoteam labels Jan 13, 2025
@marcosmarxm marcosmarxm changed the title The issue of slower speed when there is no changed data during CDC sync. [source-mssql] The issue of slower speed when there is no changed data during CDC sync. Jan 13, 2025
@theashishbhatt
Copy link

What's value of time out in source setting?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cdc community connectors/source/mssql team/db-dw-sources Backlog for Database and Data Warehouse Sources team team/use
Projects
None yet
Development

No branches or pull requests

4 participants