[TiDB Usage Environment] Production Environment
[TiDB Version] v6.5.1
[Reproduction Path] Operations performed that led to the issue
[Encountered Issue: Phenomenon and Impact]
[Resource Configuration] 3 PD nodes, 5 TiKV nodes, 1 TiFlash node, all with 8 cores and 32GB RAM
[Attachments: Screenshots/Logs/Monitoring]
This is the highest CPU usage on one machine, others are basically below 50%
Open the TiDB Dashboard and check the Slow Queries, TopSQL, and Statement Analysis pages. Especially on the TopSQL page, you can basically identify which SQL statements are consuming the most CPU.
I don’t quite understand where you saw this, but I directly checked the database processlist. Anyway, there are no very long SQL queries, but overall it is still very slow.
However, you can first display the cluster distribution. If the slow SQL queries have already been killed but only one TiKV node still has high CPU usage, it is likely a hotspot issue (high-frequency small queries). You can check the topSQL panel in the TiDB Dashboard, select the TiKV node with high CPU usage, and see which type of SQL is consuming the most CPU.
Is it convenient to collect a full monitoring report with Clinic? Currently, there is too little information, and we are relying on guesswork --》 PingCAP Clinic Quick Start Guide | PingCAP Documentation Center
It’s equivalent to getting an AWR report in Oracle!!!