TiKV node is being decommissioned but remains in the decommissioning state; another port server is alive but monitoring indicates it is offline

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: TiKV下线节点,但是一直下线中,另外一个端口服务器存活,但是监控提示不在线

| username: Johnnes_Xnn

【TiDB Usage Environment】Production Environment
【TiDB Version】
【Reproduction Path】View on the monitoring page
【Encountered Problem: TiKV node is offline, but it remains in the offline state. Another port server is alive, but the monitoring indicates it is offline.】
【Resource Configuration】Go to TiDB Dashboard - Cluster Info - Hosts and take a screenshot of this page
【Attachments: Screenshot/Logs/Monitoring】

| username: xfworld | Original post link

Check the service status of the node to see if it is alive…

The most direct way is to look at the logs.

| username: tidb狂热爱好者 | Original post link

First, back up the data, then add a new node to see if it works. The data from the 3 TiKV that went offline has nowhere to go.

| username: 有猫万事足 | Original post link

If you have only 3 TiKV instances with the default 3 replicas and you want to remove one, it will likely remain pending.

You need to scale out first, then scale in to meet the minimum requirement of 3 replicas.

| username: tidb狂热爱好者 | Original post link

There’s no place for the data to go.

| username: tidb菜鸟一只 | Original post link

Try to find another machine and expand TiKV by one more node first.

| username: wangccsy | Original post link

Is it possible for an offline node to still have logical processing?

| username: TiDBer_jYQINSnf | Original post link

Check pd-ctl, if there are 3 replicas, then there are 3 TiKV nodes, it can’t be moved.

| username: dba远航 | Original post link

Check the logs of the problematic node.