Restoring the Use of TiUP Components

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tiup组件恢复使用

| username: 普罗米修斯

[TiDB Usage Environment] Production Environment
[TiDB Version] v5.2.4
[Encountered Problem: Problem Phenomenon and Impact]
Tiup and tikv are on the same server. This server crashed in the early morning. The tikv on this server can be taken offline through other pd nodes. Could you please advise how to recover the tiup component on other nodes?

| username: 像风一样的男子 | Original post link

Refer to this

| username: TiDB_C罗 | Original post link

Can’t you even access the operating system? If you can, copy the .tiup directory to other nodes.

| username: 普罗米修斯 | Original post link

Can’t access the operating system.

| username: Kongdom | Original post link

If you can’t even get into the operating system, you can only refer to the solution in the first post. Additionally, I recommend regularly backing up with tiup. We currently have two scheduled backups: one for the database and one for tiup.

| username: zhanggame1 | Original post link

I remember you can use the same YAML deployment file to deploy on other machines. You can test it out.

| username: dba远航 | Original post link

Can’t you just install tiup on the new machine directly?

| username: 普罗米修斯 | Original post link

I found the initial topology.yaml configuration file. How do I redeploy it?

| username: Jolyne | Original post link

Just redeploy it the same way as before with tiup, and remember to rename the previously deployed instance.

| username: TiDBer_小阿飞 | Original post link

Download an offline package and deploy it on the new node, right?

| username: 普罗米修斯 | Original post link

Hello, after reading this post, I have three questions for you to check:

  1. In the second step, do we need to mv the startup files under all nodes?
  2. The original Grafana, monitoring, and alertmanager were also on the crashed server. Can the latest topology.yaml file specify deployment on other servers?
  3. For the TiKV node configuration on the crashed server, do we no longer need to add it to the configuration file?
| username: 普罗米修斯 | Original post link

So, all the startup services of the original instance have been renamed, right? If the TiKV node on the server is down, does it mean it doesn’t need to be written anymore?

| username: 普罗米修斯 | Original post link

You can directly deploy the new node.

| username: 普罗米修斯 | Original post link

No need to rename the instance

| username: 普罗米修斯 | Original post link

Sure.

| username: heiwandou | Original post link

Redeploy and then restore the cluster data from the backup.

| username: Kongdom | Original post link

:thinking: My understanding is that the failed TiKV node still needs to appear in the configuration file. After configuring it, use the tiup cluster scale-in --force command to forcibly scale in and remove this node to avoid cached information.

| username: xingzhenxiang | Original post link

Make a backup every time a change occurs.

| username: TiDBer_gxUpi9Ct | Original post link

Redeploy

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.