Solution for All Three PD Nodes Down

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: pd三个节点全挂了解决方案

| username: wfxxh

[TiDB Usage Environment] Production environment
[TiDB Version] v5.4.2, physical machine with 40 cores, 128G, pure TiDB independent cluster
[Encountered Problem: Problem Phenomenon and Impact] All PDs are down, and restarting doesn’t work

pd_stderr61.log (1.3 KB)
pd_stderr62.log (812 bytes)
pd_stderr63.log (812 bytes)
pd61.log (81.0 KB)
pd62.log (71.0 KB)
pd63.log (77.9 KB)

| username: WalterWj | Original post link

The logs are very clear, the port is occupied. Did you do a mixed deployment and not change the port?

| username: wfxxh | Original post link

Take another look.

| username: ohammer | Original post link

It looks like node 1 has panicked. Can the cluster function normally if we start only nodes 2 and 3?

| username: wfxxh | Original post link

No. The PD cluster has already been reset, and the cluster has been reset using pd-recover.

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.