Testing Auto Sync Disaster Recovery: Unsafe Remove-Failed-Stores Not Triggered

| username: Gin | Original post link

This is the complete disaster recovery process. For details, you can refer to the disaster recovery manual in the link above.

  1. Force recover a single replica PD (in a scenario where 5 PDs are deployed in a 3:2 configuration across two centers, any one of the PDs can be chosen for recovery, and the other PD in the disaster recovery center will be abandoned).
  2. Adjust Placement-Rules to convert Learner replicas to Voter replicas, resulting in a 2-replica mode for the recovered cluster.
  3. Disable the DR Auto-Sync feature and switch to the default Majority mode.
  4. Use pd-ctl to clear all TiKV in the primary center online.
  5. Use pd-recover to increase the PD allocate-id by +100000000 to ensure that the subsequently allocated region ids, etc., do not roll back.