How to Synchronize Data from TiDB to HDFS in Real-Time?

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: 如何将 TiDB 中的数据实时的同步写入到 HDFS 上?

| username: dcswinner

[TiDB Usage Environment] Production Environment / Testing / POC
[TiDB Version] v6.5
[Reproduction Path] Operations performed that led to the issue
[Encountered Issues: Issue Symptoms and Impact]
[Resource Configuration]
[Attachments: Screenshots / Logs / Monitoring]

| username: xfworld | Original post link

Install the component ticdc

Then CDC can choose which data to capture in real-time and send to the downstream component Kafka

There are many ways to transfer data from Kafka to HDFS, please refer to and choose accordingly

| username: liuis | Original post link

TiCDC

| username: dockerfile | Original post link

ticdc—kafka–consume and write to any downstream