TiDB Request for Truncated KV

translator_bot · June 22, 2024, 9:06pm

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tidb 请求已经缩掉的kv

| username: Holland

[TiDB Usage Environment] Production Environment / Testing / PoC
[TiDB Version]
tidb 4.0.14. I removed a KV node, and it is no longer visible in pd-ctl. Why is TiDB still requesting this offline KV?

[Reproduction Path] What operations were performed to cause the issue
[Encountered Issue: Symptoms and Impact]
[Resource Configuration]
[Attachments: Screenshots/Logs/Monitoring]

translator_bot · June 22, 2024, 9:06pm

| username: weixiaobing | Original post link

When was the scale-down completed? Use tiup cluster display to check.

translator_bot · June 22, 2024, 9:06pm

| username: Holland | Original post link

Did not use tiup

translator_bot · June 22, 2024, 9:06pm

| username: Holland | Original post link

It looks like 13:56.

translator_bot · June 22, 2024, 9:06pm

| username: weixiaobing | Original post link

So how did you scale down?

translator_bot · June 22, 2024, 9:06pm

| username: Holland | Original post link

First, evict the store’s leader, then execute pd-ctl store delete 5016. After waiting for the KV to become a tombstone, execute remove tombstone.

translator_bot · June 22, 2024, 9:06pm

| username: weixiaobing | Original post link

This approach may result in the cluster information not being updated, so TiDB will still make requests, which can take a long time. It is recommended to use TiUP for scaling in and out: 使用 TiUP 扩容缩容 TiDB 集群 | PingCAP 归档文档站

translator_bot · June 22, 2024, 9:06pm

| username: h5n1 | Original post link

What specific command was executed, and what was the error reported? It seems like the region information in the PD cache hasn’t been updated.

translator_bot · June 22, 2024, 9:06pm

| username: Holland | Original post link

pd-ctl

scheduler add evict-leader-scheduler-5106
Wait for the leader to become 0
store delete 5016
Wait for the store to become tombstone
store remove-tombstone

translator_bot · June 22, 2024, 9:06pm

| username: Holland | Original post link

I have restarted both PD and TiDB. It’s useless.

translator_bot · June 22, 2024, 9:06pm

| username: Billmay表妹 | Original post link

Check out this article by Binbin.

translator_bot · June 22, 2024, 9:06pm

| username: Holland | Original post link

I replaced all the TiDB nodes through scaling up and down, and then restored them. Also, when the TiDB layer encounters a region miss during a query, it will re-fetch the region information from PD, right? Could it be that the fetch was unsuccessful? If it was successful, why does it still request the KV that has already been removed next time?