Eliminating PD_down_peer_region_nums Alert

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: PD_down_peer_region_nums告警消除

| username: TiDBer_pkQ5q1l0

[TiDB Usage Environment] Production Environment
[TiDB Version] V6.1.0
[Encountered Issue] PD_down_peer_region_nums Alert
[Issue Phenomenon and Impact]
The alertmanager shows a PD_down_peer_region_nums alert. How can I fix it?

| username: Billmay表妹 | Original post link

Could it be caused by this issue~

| username: Billmay表妹 | Original post link

PD_down_peer_region_nums

  • Alert Rule: (sum(pd_regions_status{type="down-peer-region-count"}) by (instance) > 0) and (sum(etcd_server_is_leader) by (instance) > 0)
  • Rule Description: The number of Regions reported by the Raft leader that have unresponsive peers.
  • Handling Method:
    • Check if there is any TiKV downtime, recent restart, or if it is busy.
    • Observe the region health panel and check if the down_peer_region_count is continuously decreasing.
    • Check for network connectivity issues between TiKVs.
| username: songxuecheng | Original post link

pd-ctl region check down-peer

| username: TiDBer_pkQ5q1l0 | Original post link

How should I handle the check result? down.pd (33.0 KB)

| username: songxuecheng | Original post link

Have there been any previous operations? Check if there are any issues with these three stores.

| username: TiDBer_pkQ5q1l0 | Original post link

Expanded by 3 TiKV nodes.

| username: songxuecheng | Original post link

If it doesn’t keep increasing, you can wait for the cluster expansion balance to complete and then observe.

| username: songxuecheng | Original post link

This node with “store_id”: 5 is a bit strange, are there any other operations?

| username: TiDBer_pkQ5q1l0 | Original post link

The expansion was done yesterday, and today I see that the balance has been completed. There were no other operations besides the expansion.

| username: songxuecheng | Original post link

You can continue to observe; it seems to still be doing region balance.

| username: TiDBer_pkQ5q1l0 | Original post link

Okay, thank you.

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.