TiDB CDC Full Data Refresh

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tidb cdc 全量数据刷新

| username: TiDBer_7PAp3Gl0

[TiDB Usage Environment] Production Environment
[TiDB Version] v7.5.0
We are using cdc-kafka to refresh incremental data into Kafka. However, to be consistent with the previous CDC methods of MySQL and SQL Server, we hope to also store full data in Kafka, and do so before the incremental data, while maintaining the continuity of both full and incremental data. What methods are currently available to achieve this?

| username: WalterWj | Original post link

How about this one?

| username: 像风一样的男子 | Original post link

CDC should only support incremental data synchronization, and full data can only be handled by other methods.

| username: Daniel-W | Original post link

CDC does not support full data, only incremental.

| username: TiDBer_QYr0vohO | Original post link

It feels like I can only write a script to achieve this.

| username: dba远航 | Original post link

This is somewhat difficult to implement.

| username: 呢莫不爱吃鱼 | Original post link

CDC can only handle incremental data, not full data. You can try using CloudCanal for synchronization and see if this workflow can be completed.

| username: 友利奈绪 | Original post link

In theory, it only supports incremental updates. You can look into it further.