Log Accumulation Issue

translator_bot June 23, 2024, 5:24pm 1

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: 日志堆积问题

| username: 健康的腰间盘

[TiDB Usage Environment] Production Environment
[TiDB Version] 7.5
[Reproduction Path] Default installed cluster
[Encountered Issue: Phenomenon and Impact] The tidb component log on one machine is too large
[Resource Configuration]

The disk usage on the first machine in the monitoring interface is too large, and it was found to be a tidb log issue.
Below are the logs from the problematic server

Abnormal Log

Below are the logs from the normal server

Normal Log

Checked show config where name like ‘%log.%’
The log.file.max-days parameter on both machines is 0
Why is there log accumulation on only the first machine (approximately 70G of logs generated)?

translator_bot June 23, 2024, 5:24pm 2

| username: Defined2014 | Original post link

Check the logs to see what they are. There might have been a network issue or something similar during that time, causing a lot of logs to be generated. If they are not needed, you can manually delete the logs.

translator_bot June 23, 2024, 5:25pm 3

| username: 健康的腰间盘 | Original post link

It contains some SQL operation information, it should not be a network issue, the logs span from March of this year to the present.

translator_bot June 23, 2024, 5:25pm 4

| username: 这里介绍不了我 | Original post link

Is your TiDB connection direct or allocated through something like LVS?

translator_bot June 23, 2024, 5:25pm 5

| username: 健康的腰间盘 | Original post link

Direct connection

translator_bot June 23, 2024, 5:25pm 6

| username: Defined2014 | Original post link

That might be because the load balancing wasn’t done properly, and everything is running on one TiDB instance.

translator_bot June 23, 2024, 5:25pm 7

| username: 健康的腰间盘 | Original post link

Yes! Thank you.

translator_bot June 23, 2024, 5:25pm 8

| username: 这里介绍不了我 | Original post link

It is recommended to add a load balancer on top and allocate based on CPU capacity. Additionally, manually delete those redundant historical files.

translator_bot July 19, 2024, 8:23am 9

| username: TiDBer_7S8XqKfl-1158 | Original post link

It feels like only one TiDB node is working, and the other nodes have no logs at all. This is definitely not right.

translator_bot July 28, 2024, 12:45am 10

| username: TiDBer_TQXaqJ6U-6236 | Original post link

It feels like everything is on one host, there might be an issue with load balancing.

translator_bot July 29, 2024, 2:24am 11

| username: TiDBer_7S8XqKfl-1158 | Original post link

Check if it’s a load balancing issue.

translator_bot July 29, 2024, 2:24am 12

| username: TiDBer_3Cusx9uk-0775 | Original post link

What load balancer are you using? It seems like the load balancing has failed.

translator_bot July 29, 2024, 2:24am 13

| username: 濱崎悟空 | Original post link

Create a load balancer.