Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.
Original topic: 监控报警问题
[TiDB Usage Environment] Production Environment
[TiDB Version] 6.5
[Reproduction Path] Modified some user permissions, but it doesn’t seem related
[Encountered Problem: Phenomenon and Impact] Query takes two to three minutes without results, business reports errors, no error information for now. This was found within the past week, abnormal monitoring matches the time of the issue (11:20)
Is everything normal after the restore operation?
Without performing a restore operation, it fixed itself after two minutes.
The most reasonable explanation found so far, but I don’t know how to prove it 
There should be many logs similar to “epoch not match” in the logs of TiKV and TiDB at corresponding time points. This is generally due to the high load on the cluster, causing rapid changes in region information, which leads to constantly obtaining outdated region information.
It seems so. The high load state lasted for about 40 minutes, but TiDB resumed normal queries after being stuck for two or three minutes.
The load has increased recently.