How to Synchronize Data from TiDB to HDFS in Real-Time?

This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: 如何将 TiDB 中的数据实时的同步写入到 HDFS 上?

| username: dcswinner

| username: xfworld

Install the component ticdc

Then CDC can choose which data to capture in real-time and send to the downstream component Kafka

There are many ways to transfer data from Kafka to HDFS, please refer to and choose accordingly

| username: liuis


| username: dockerfile

ticdc—kafka–consume and write to any downstream