Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.Original topic: 寻求TiDB处理半结构化数据的最佳实践

Conducting a technical survey for a project that the company is about to launch. It is expected to have half structured data and half semi-structured data. The system will be deployed in an internal network data center.
If it were deployed on a public cloud, it would probably be MySQL+MongoDB, as there is almost no need for maintenance.
If it is in an internal network data center, a distributed database is necessary. It is also estimated that it will be split into several subsystems, each with its own independent data cluster.
Using a TiDB cluster + MongoDB cluster feels too complex. I hope to handle everything with one setup.
There are probably not many transactional operations, and storing structured data in MongoDB is not a big issue. But it feels like the MongoDB community is almost non-existent.
If using TiDB, I am very concerned about its support for semi-structured data. I checked the documentation, and TiDB has some simple support for JSON, which seems to be still in the experimental stage: JSON 类型 | PingCAP 文档中心
So I am seeking some practical experience with TiDB handling semi-structured data and technical selection advice.