Severe Data Skew Issue When Syncing Data from TiCDC to Kafka

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: ticdc往kafka同步数据出现数据倾斜严重问题

| username: xxxxxxxx

Background: ticdc synchronizes incremental data to Kafka with partition-num=4 set.
Image

Phenomenon: Currently, only one partition in the downstream Kafka has data, while the other partitions are empty.
Enterprise WeChat Screenshot_17007262704538

Version: 6.1.7

What could be the reason for this? Is it because “dispatchers” is not set? I did not configure this parameter separately, and in the CDC task details, this dispatchers is null. According to online documentation, if you need to balance the data across partitions, you need to configure this parameter.

CDC sink configuration
Image

| username: db_user | Original post link

Take a look at the specific partitions of this topic on Kafka.