Restoring the Use of TiUP Components

translator_bot · June 21, 2024, 12:36pm

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tiup组件恢复使用

| username: 普罗米修斯

[TiDB Usage Environment] Production Environment
[TiDB Version] v5.2.4
[Encountered Problem: Problem Phenomenon and Impact]
Tiup and tikv are on the same server. This server crashed in the early morning. The tikv on this server can be taken offline through other pd nodes. Could you please advise how to recover the tiup component on other nodes?

translator_bot · June 21, 2024, 12:36pm

| username: 像风一样的男子 | Original post link

Refer to this

TiDB 的问答社区 – 10 Jul 20

[FAQ] .tiup 等元信息被删除恢复办法

🌌 运维指南 TiDB 常见 FAQ

明确下本次恢复的目的，是恢复 .tiup 中的元数据，此为管理集群的基础。有个这些元数据，新的 tiup 将会继续运维以前的集群恢复步骤手写一下最终的集群 topo 文件，需要批量将 instance 级别的 bin/{instance}-server 文件 mv ，解释可看 [2] 根据 tiup 部署集群步骤，进行 deploy 操作，解释可看 [3] [2] 因为使用已发布的 tiup 进行部署，需要覆盖 instance 级别的 binary...

阅读时间: 1 mins 🕑 赞: 2 ❤

translator_bot · June 21, 2024, 12:36pm

| username: TiDB_C罗 | Original post link

Can’t you even access the operating system? If you can, copy the .tiup directory to other nodes.

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

Can’t access the operating system.

translator_bot · June 21, 2024, 12:36pm

| username: Kongdom | Original post link

If you can’t even get into the operating system, you can only refer to the solution in the first post. Additionally, I recommend regularly backing up with tiup. We currently have two scheduled backups: one for the database and one for tiup.

translator_bot · June 21, 2024, 12:36pm

| username: zhanggame1 | Original post link

I remember you can use the same YAML deployment file to deploy on other machines. You can test it out.

translator_bot · June 21, 2024, 12:36pm

| username: dba远航 | Original post link

Can’t you just install tiup on the new machine directly?

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

I found the initial topology.yaml configuration file. How do I redeploy it?

translator_bot · June 21, 2024, 12:36pm

| username: Jolyne | Original post link

Just redeploy it the same way as before with tiup, and remember to rename the previously deployed instance.

translator_bot · June 21, 2024, 12:36pm

| username: TiDBer_小阿飞 | Original post link

Download an offline package and deploy it on the new node, right?

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

Hello, after reading this post, I have three questions for you to check:

In the second step, do we need to mv the startup files under all nodes?

Image777×194 16.7 KB
The original Grafana, monitoring, and alertmanager were also on the crashed server. Can the latest topology.yaml file specify deployment on other servers?
For the TiKV node configuration on the crashed server, do we no longer need to add it to the configuration file?

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

So, all the startup services of the original instance have been renamed, right? If the TiKV node on the server is down, does it mean it doesn’t need to be written anymore?

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

You can directly deploy the new node.

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

No need to rename the instance

translator_bot · June 21, 2024, 12:36pm

| username: 普罗米修斯 | Original post link

Sure.

translator_bot · June 21, 2024, 12:36pm

| username: heiwandou | Original post link

Redeploy and then restore the cluster data from the backup.

translator_bot · June 21, 2024, 12:36pm

| username: Kongdom | Original post link

My understanding is that the failed TiKV node still needs to appear in the configuration file. After configuring it, use the tiup cluster scale-in --force command to forcibly scale in and remove this node to avoid cached information.

translator_bot · June 21, 2024, 12:36pm

| username: xingzhenxiang | Original post link

Make a backup every time a change occurs.

translator_bot · June 21, 2024, 12:36pm

| username: TiDBer_gxUpi9Ct | Original post link

Redeploy

translator_bot · June 21, 2024, 12:36pm

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.