The value of tidb_tikvclient_backoff_seconds_count is nearly ten thousand

translator_bot · June 22, 2024, 9:59pm

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: tidb_tikvclient_backoff_seconds_count value 近万

| username: leopardxu

[TiDB Usage Environment] Production Environment
[TiDB Version] v6.1.2
[Reproduction Path] Operations performed that led to the issue
[Encountered Issue: Issue Phenomenon and Impact]
[Resource Configuration]
[Attachments: Screenshots/Logs/Monitoring]

translator_bot · June 22, 2024, 9:59pm

| username: Billmay表妹 | Original post link

What is the question you would like to ask?

translator_bot · June 22, 2024, 9:59pm

| username: leopardxu | Original post link

Can you see what the problem is? I’ve set the threshold to 3000, but it still keeps alerting. My V5.2.1 cluster doesn’t alert with a threshold of 100.

translator_bot · June 22, 2024, 9:59pm

| username: Lucien-卢西恩 | Original post link

Could you share the alert information? It might be due to other tasks pending or an issue with the monitoring data.

translator_bot · June 22, 2024, 9:59pm

| username: 特雷西-迈克-格雷迪 | Original post link

We are using version 6.1, and it reports about ten issues a day.

translator_bot · June 22, 2024, 9:59pm

| username: tidb菜鸟一只 | Original post link

Check the backoff error logs in the logs to see what the errors are.

translator_bot · June 22, 2024, 9:59pm

| username: leopardxu | Original post link

The image you provided is not accessible. Please provide the text content you need translated.

translator_bot · June 22, 2024, 9:59pm

| username: Lucien-卢西恩 | Original post link

From the provided monitoring alert screenshot, the total amount has reached the w level, but sometimes there are different types of alerts. We need to confirm whether the actual monitoring has such a high value. The monitoring screenshot you provided covers 10 hours, but the monitoring data statistics should be for more than just 10 hours; it should be for the entire period. What is the current impact? Also, looking at the bottom screenshot, it seems to be monitoring information rather than alert information. How is this strategy designed?

translator_bot · June 22, 2024, 9:59pm

| username: tidb菜鸟一只 | Original post link

Did you set the alert threshold incorrectly? Shouldn’t it be the max value instead of the total value?

translator_bot · June 22, 2024, 9:59pm

| username: jansu-dev | Original post link

You need to issue the corresponding alert expression or alert information to conduct a targeted analysis.

translator_bot · June 22, 2024, 9:59pm

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.