Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.Original topic: 如何基于TiDB和开源组件打造一个轻量级中台
Requirements:
- Data Collection: Mainly from Oracle, MySQL, the group’s data sharing and exchange platform, and a large number of standalone systems (possibly using SQLite, Access). The collection cycle is generally on a daily basis.
- Computation: Micro-batch processing is sufficient, OLTP data volume is very small; there are some OLTP requirements, but they do not change much; the volume of log analysis is also not large.
- Storage: Huawei Cloud Storage, the annual data volume should not exceed 10 million records.
- Governance: There are governance requirements.
- Shared Exchange: Need to provide shared exchange functionality.
- Visualization: There is an IOC platform that needs to call data indicator results from the data middle platform.
Plan:
Use TiDB for data import, storage, data warehouse, and services; use Flink for stream and batch processing; Kafka for message queue; Redis for caching; ES for log collection; Data Hub for data governance (open-sourced by LinkedIn, not Alibaba); ZK/YARN/HDFS are not emphasized.
Drawbacks:
Can TiDB achieve these functionalities? Hope to have a unified web management interface.
Only have theoretical knowledge, some CRUD experience with MySQL, hope to get guidance from everyone, please be gentle!!