Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.
Original topic: 突然不能链接数据库
Today, I suddenly couldn’t access the database. The log shows: grpc: Server.processUnaryRPC failed to write status connection error: desc = "transport is closing
What could be the reason?
First, check the cluster status:
tiup cluster display xxx
What error does it report when it can’t connect to the client?
Check the network status first.
The cluster means this device is offline.
The network SSH connection is not working, and it takes about 20 minutes to log in. After logging in, the cluster responds.
Take a look at the logs for node 34, and use display to check the status of node 34.
The gRPC issue is generally due to network disconnection or a service crash. First, check the cluster status with the display command.
Can’t connect to SSH? Did the server restart?
Check the network connection.
Check the cluster status and network.
SSH cannot connect. Either the network is down, or the memory is exhausted. Please check.
Before asking questions, make sure to follow the troubleshooting process yourself, check each step, and keep evidence. Otherwise, it will be difficult for others to help you pinpoint the issue.
If it was working fine and suddenly went offline, it’s most likely an OOM (Out of Memory) issue.
Authentication system issue? DNS, LDAP, IPA?
Check the network status and the maximum number of connections, and take a look at the local network settings.
Haven’t found the reason yet?