TiKV Monitoring Lost

This topic has been translated from a Chinese forum by GPT and might contain errors.

| username: julyxiong

【TiDB Usage Environment】Production
【TiDB Version】v3.0.3
【Encountered Problem】TiKV monitoring lost
【Reproduction Path】
【Problem Phenomenon and Impact】
The monitoring of one TiKV in Prometheus is lost, and another one has a monitoring breakpoint.

| username: julyxiong |

Can this issue be resolved by directly restarting TiKV?

| username: xfworld |

The issues need to be looked at separately:

  1. Is the TiKV node service functioning normally?
  2. Are the black box_ exporter and Node Exporter services on the TiKV node functioning normally?

If the TiKV service is functioning normally, restarting the TiKV service won’t help in starting the exporter…

Please refer to this~

| username: Kongdom |

Could it be caused by the repeated switching of the PD leader?

| username: wuxiangdong |

First, check if the ports for TiKV and the exporter are still available.

| username: julyxiong |

Unable to wait for community assistance, I directly restarted the TiKV instance :sweat_smile:

After the restart, the metrics interface of TiKV is back to normal.

Now waiting for the TiKV Scheduler error to return to normal…