Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.Original topic: tiup离线升级问题
[TiDB Usage Environment] Production Environment
[TiDB Version] Upgraded from tidb v5.4.0 to v5.4.3
At the end of the upgrade, an error occurred when stopping node_exporter, with the following information:
This issue prevents the continuation of the process using tiup cluster replay audit_id. Even if the node_exporter on the relevant cluster servers has been manually stopped, the error persists.
[Encountered Issues]
- How to manually complete the remaining steps, which are to stop node_exporter, stop blackbox_exporter, start node_exporter, and start blackbox_exporter.
- Although the bug has been fixed at this stage and the MySQL client shows version 5.4.3 when logging into the cluster, tiup cluster display still shows the cluster or components as version 5.4.0.
- Reproducing the issue in a test environment shows that only re-upgrading can solve the problem. However, restarting causes business disconnection, so I want to know if there is a way to solve this issue without restarting. Thank you.
[Reproduction Path] Operations that led to the issue
Manually deployed other node_exporters on component servers, then executed the upgrade operation, encountered an error, manually stopped node_exporter, executed tiup cluster replay audit_id, and still encountered an error.
[Issue Phenomenon and Impact]
Upgrade failed with an error and exited.
Currently, apart from display issues, other impacts are unclear.
[Attachments]
Please provide the version information of each component, such as cdc/tikv, which can be obtained by executing cdc version/tikv-server --version.