Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.
Original topic: TiFlash 数据量分布不均匀
【TiDB Usage Environment】Production Environment
【TiDB Version】7.5
【Encountered Problem: Problem Phenomenon and Impact】
The data distribution in TiFlash is uneven. Two of the three nodes have tens of gigabytes of data, while the other has 200 gigabytes. How can I balance the data among the TiFlash nodes? Can any experts help me figure out how to solve this?
Is the storage surge a GC issue? All other nodes are normal.
Is there a mixed deployment? First, log in to the host to confirm if all the data is from TiFlash. Some of it might be log data causing the imbalance.
A certain table suddenly ballooned in file size to 174GB, but the table size only shows 2GB.
Are you saying that the GC didn’t delete the old data?
Check the TiFlash logs for any related error information?
Indeed, it is historical and has not been deleted. The normal TiFlash node is just one file.
The reboot method solved it.
Does anyone know the reason why it was resolved? Restarting fixed it.
The three basic IT troubleshooting steps: restart, reinstall, replace the machine.
Another possibility is that the data in the monitoring system is outdated. After restarting and fetching new data, it should be fine.