Why is the size of TiKV different?

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tikv为什么大小不一样

| username: 大钢镚13146

[TiDB Usage Environment] Production Environment / Testing / PoC
[TiDB Version]
[Reproduction Path] What operations were performed that led to the issue
[Encountered Issue: Problem Phenomenon and Impact]
[Resource Configuration] Go to TiDB Dashboard - Cluster Info - Hosts and take a screenshot of this page
[Attachments: Screenshots/Logs/Monitoring]
The space occupied by different TiKV nodes is inconsistent? What causes this issue and how can it be resolved?

| username: tidb菜鸟一只 | Original post link

Is the balance process not progressing? Are the store weights set differently? Check if there are issues with the labels.

| username: 像风一样的男子 | Original post link

First, confirm whether the disks of each KV are the same size, and then check the region distribution in the monitoring.

| username: 大钢镚13146 | Original post link

500G, the maximum and minimum difference in TiKV is about 100G of space. This doesn’t seem to be a normal range, right?

| username: ajin0514 | Original post link

Take a look at the system tables.

| username: 普罗米修斯 | Original post link

Please share this table for us to take a look.

| username: realcp1018 | Original post link

Hahaha, do you want to take a look at this tool: tidb-toolkit/scripts/tk_pdctl.py at main · realcp1018/tidb-toolkit · GitHub
Just fill in the IP and Port.

| username: Soysauce520 | Original post link

PD scheduling monitoring

| username: 大飞哥online | Original post link

Region distribution

| username: ajin0514 | Original post link

Different region distribution

| username: 大飞哥online | Original post link

Sometimes the number might be different, but the size will be similar.

| username: zhanggame1 | Original post link

If the TiKV disks are of different sizes, there might be some differences. Additionally, if the data volume is too small, that could also be a factor.

| username: Jellybean | Original post link

You need to check the working status of the balance scheduler, whether it is scheduling normally, and whether the scheduling strategy is reasonable. For example, whether the label settings of TiKV and PD are correct, whether the disk sizes are consistent, whether the available space is the same, whether the pressure is balanced, and the scoring of each instance node.

Once you have checked the above information, your problem should basically be resolved.

| username: YuchongXU | Original post link

Are the host configurations the same?

| username: Kongdom | Original post link

If the difference is not significant, it is a normal phenomenon, as data cannot be evenly distributed in the true sense.

| username: 大飞哥online | Original post link

Relatively, but not absolutely.

| username: ajin0514 | Original post link

Take a look at the settings.

| username: TiDBer_小阿飞 | Original post link

After pushing PD, the message queue creates regions on each KV inconsistently, right?

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.