Why is the size of TiKV different?

translator_bot · June 21, 2024, 7:32pm

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tikv为什么大小不一样

| username: 大钢镚13146

[TiDB Usage Environment] Production Environment / Testing / PoC
[TiDB Version]
[Reproduction Path] What operations were performed that led to the issue
[Encountered Issue: Problem Phenomenon and Impact]
[Resource Configuration] Go to TiDB Dashboard - Cluster Info - Hosts and take a screenshot of this page
[Attachments: Screenshots/Logs/Monitoring]
The space occupied by different TiKV nodes is inconsistent? What causes this issue and how can it be resolved?

translator_bot · June 21, 2024, 7:32pm

| username: tidb菜鸟一只 | Original post link

Is the balance process not progressing? Are the store weights set differently? Check if there are issues with the labels.

translator_bot · June 21, 2024, 7:32pm

| username: 像风一样的男子 | Original post link

First, confirm whether the disks of each KV are the same size, and then check the region distribution in the monitoring.

translator_bot · June 21, 2024, 7:32pm

| username: 大钢镚13146 | Original post link

500G, the maximum and minimum difference in TiKV is about 100G of space. This doesn’t seem to be a normal range, right?

translator_bot · June 21, 2024, 7:32pm

| username: ajin0514 | Original post link

Take a look at the system tables.

translator_bot · June 21, 2024, 7:32pm

| username: 普罗米修斯 | Original post link

Please share this table for us to take a look.

translator_bot · June 21, 2024, 7:32pm

| username: realcp1018 | Original post link

Hahaha, do you want to take a look at this tool: tidb-toolkit/scripts/tk_pdctl.py at main · realcp1018/tidb-toolkit · GitHub
Just fill in the IP and Port.

translator_bot · June 21, 2024, 7:32pm

| username: Soysauce520 | Original post link

PD scheduling monitoring

translator_bot · June 21, 2024, 7:32pm

| username: 大飞哥online | Original post link

Region distribution

translator_bot · June 21, 2024, 7:32pm

| username: ajin0514 | Original post link

Different region distribution

translator_bot · June 21, 2024, 7:32pm

| username: 大飞哥online | Original post link

Sometimes the number might be different, but the size will be similar.

translator_bot · June 21, 2024, 7:32pm

| username: zhanggame1 | Original post link

If the TiKV disks are of different sizes, there might be some differences. Additionally, if the data volume is too small, that could also be a factor.

translator_bot · June 21, 2024, 7:32pm

| username: Jellybean | Original post link

You need to check the working status of the balance scheduler, whether it is scheduling normally, and whether the scheduling strategy is reasonable. For example, whether the label settings of TiKV and PD are correct, whether the disk sizes are consistent, whether the available space is the same, whether the pressure is balanced, and the scoring of each instance node.

Once you have checked the above information, your problem should basically be resolved.

translator_bot · June 21, 2024, 7:32pm

| username: YuchongXU | Original post link

Are the host configurations the same?

translator_bot · June 21, 2024, 7:32pm

| username: Kongdom | Original post link

If the difference is not significant, it is a normal phenomenon, as data cannot be evenly distributed in the true sense.

translator_bot · June 21, 2024, 7:32pm

| username: 大飞哥online | Original post link

Relatively, but not absolutely.

translator_bot · June 21, 2024, 7:32pm

| username: ajin0514 | Original post link

Take a look at the settings.

translator_bot · June 21, 2024, 7:32pm

| username: TiDBer_小阿飞 | Original post link

After pushing PD, the message queue creates regions on each KV inconsistently, right?

translator_bot · June 21, 2024, 7:32pm

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.