Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.
Original topic: TiDB-server故障排查
Where can I see the logs when TiDB-server encounters an OOM during operation, or which monitoring module in Prometheus can observe it? Is the OOM mostly caused by slow SQL?
The TiDB-server experiencing OOM is mostly caused by slow SQL queries. To determine if it is an OOM issue, you can check here: TiDB OOM Troubleshooting | PingCAP Documentation Center
You can just look at the TiDB logs.
You can check TiDB logs and the system logs of the operating system.
For monitoring, you can look at the overview panel in Grafana.
Most OOM (Out of Memory) issues are caused by slow SQL queries.
It is generally due to an inefficient SQL execution plan, lack of appropriate indexes, outdated statistics, or optimizer bugs.
I mostly know about major things, mainly SQL-related stuff.
Will TiKV also experience OOM (Out of Memory)?
Search for “out of memory” on the dashboard or the machine where tidb-server is deployed.
It is possible. If the capacity parameter is set correctly, TiKV will basically not encounter OOM situations.
Memory is not too small, and it won’t be an issue if not deployed in a mixed environment.
TiDB-server encountered OOM, which in most cases is caused by large SQL queries.
This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.