TiDB PD service is always in a down state

[FATAL] [main.go:232] [“run server failed”] [error=“[PD:server:ErrCancelStartEtcd]etcd start canceled”] [stack=“main.start

[ERROR] [etcdutil.go:83] [“failed to get cluster from remote”] [error=“[PD:etcd:ErrEtcdGetCluster]failed to get raft cluster member(s) from the given URLs: failed to get raft cluster member(s) from the given URLs”]
[2024/04/11 17:39:13.633 +08:00] [WARN] [server.go:2098] [“failed to publish local member to cluster through raft”] [local-member-id=b43ecfd4b44129fc] [local-member-attributes=“{Name:pd-1 ClientURLs:[]}”] [request-path=/0/members/b43ecfd4b44129fc/attributes] [publish-timeout=11s] [error=“etcdserver: request timed out”]

Is it a new cluster? It looks like the startup parameters for PD are configured incorrectly, particularly the URL part.

Isn’t it usually 2379? Did you make a mistake?

There is no mistake, we used a custom port.

It’s not a new cluster; it has been running for a while. One of the PD nodes has been in a down state because the file system is full.

Can’t connect to PD, check your network. If the network is fine, execute the following in a normal PD:


See if it exists. If it does, use:

member delete

to delete it, then clear the data directory of this PD and rebuild it. The data volume of PD is very small, rebuilding won’t take much time.

It might be that the PD data directory was not cleared. I’ll try again tomorrow. Thank you.

If there is insufficient space, it will definitely cause server anomalies. Try clearing unused files.