[TiDB Usage Environment] Production Environment
[TiDB Version] v6.1.0
[Reproduction Path] Operations performed that led to the issue
[Encountered Issue: Issue Phenomenon and Impact]
Alert Details: TiCDC heap memory usage is over 10 GB
[Resource Configuration]
Configured 256G of host memory for cdc, actual usage is 10G, but it still uses host disk space. How to adjust to use more memory, and the cdc task has delays. No large transactions found on the source side, how to analyze if it is slow to push to Kafka.
total used free shared buff/cache available
Mem: 251 9 232 0 9 241
Swap: 31 0 31
[Attachments: Screenshots/Logs/Monitoring]
Upstream
a. Large transaction commits causing delayed processing
b. Resource bottlenecks leading to slow event processing
c. Network congestion
CDC itself
Refer to the monitoring metrics which are easily accessible, no need to describe further.
Downstream
a. Reception delay, downstream congestion causing backlog and untimely processing
b. Network congestion
c. Insufficient resources leading to slow processing
You will need to investigate each item specifically, which can be quite troublesome. This is for your reference.
In the case of deploying TiCDC separately, is the usage of TiCDC heap memory on the TiKV host or the CDC host? The principle is still somewhat unclear.
There are too many files being collected, the files are too large, and it’s too slow, causing the collection to be interrupted each time. Is there any other diagnostic method?