Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.Original topic: TiDB 6.1.0 版本 TiFlash 异常重启
In the production environment, after upgrading to 6.1.0 the previous day and enabling dynamic pruning mode, all TiFlash nodes reported memory allocation errors during the nightly batch run, causing the servers to restart. After the restart, dynamic pruning mode was disabled. The environment is a mixed deployment of TiKV and TiFlash. Today, there were no memory allocation errors, but the following error occurred. Only one TiFlash node had an issue, and this node did not have NUMA configured, while other mixed deployment nodes had NUMA configured.
【TiDB Version】
6.1.0
【Issue Encountered】
During the nightly batch run, TiFlash repeatedly restarted with the following error:
tiflash_stderr.log:
Logging debug to /data01/deploy/log/tiflash.log
Logging errors to /data01/deploy/log/tiflash_error.log
deprecated configuration, log-file has been moved to log.file.filename
override log.file.filename with log-file, “/data01/deploy/log/tiflash_tikv.log”
libc++abi: terminate_handler unexpectedly threw an exception
Logging debug to /data01/deploy/log/tiflash.log
Logging errors to /data01/deploy/log/tiflash_error.log
deprecated configuration, log-file has been moved to log.file.filename
override log.file.filename with log-file, “/data01/deploy/log/tiflash_tikv.log”
tiflash.log:
【Solution】
After forcibly scaling down and then scaling up the problematic node, the error did not reoccur, and NUMA binding was added.