[TiDB Usage Environment] Production Environment
[TiDB Version] 6.5.1
[Reproduction Path] Operations performed that led to the issue
[Encountered Issue: Issue Phenomenon and Impact]
[Resource Configuration] Enter TiDB Dashboard - Cluster Info - Hosts and take a screenshot of this page
[Attachment: Screenshot/Log/Monitoring]
On October 24th, a large number of connection error logs appeared in the tidb server logs, as shown below:
After troubleshooting the network, no obvious anomalies were found; the TiDB Server did not restart or have any other issues. TiDB uses Consul for load balancing, but during this time period, the ERROR logs mentioned above appeared in the logs of both TiDB servers.
The red color indicating LOCK resolve on the OPS panel seems a bit abnormal. Is there a noticeable increase in txnLock on the KV Backoff OPS panel during the same time period? Could it be due to a brief write conflict caused by unlocking?
The reason for this error is that one side is closing the connection while the other side is writing data.
If it’s not an issue with the connection pool strategy on the application side, then you should check the settings of proxies like HAProxy. Additionally, connection number limits on either side could also cause this problem.
You can refer to this: golang服务报错: write: broken pipe-CSDN博客.
TiDB itself will not proactively close this connection. Moreover, this error on the server side also indicates that the party closing the connection is most likely not TiDB.