GC is normal but still shows "too many versions containing deleted or overwritten but not GC'd"

translator_bot · June 23, 2024, 7:21am

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: GC正常但还是显示“含已删除或覆盖但未 GC 的版本”数太多

| username: wakaka

[TiDB Usage Environment] Production
[TiDB Version] 5.2.2
[Encountered Problem] Simple queries are very slow
[Reproduction Path]

[Problem Phenomenon and Impact]

[Attachments]

Please provide the version information of each component, such as cdc/tikv, which can be obtained by executing cdc version/tikv-server --version.

translator_bot · June 23, 2024, 7:21am

| username: xfworld | Original post link

This kind of problem can only be avoided by preventing full table scans. I shared this before

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

Selecting count(1) from table t is very fast and only takes 0.x seconds, while the query with conditions shown in the picture above takes around 5 seconds.

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

Could you please share the article again? I couldn’t find it.

translator_bot · June 23, 2024, 7:21am

| username: xfworld | Original post link

“select count(1) from t” directly uses the primary key index and will not perform a full table scan.

Offline event in Wuhan,

TiDB 的问答社区 – 18 Jul 22

【活动回顾 & 资料下载】武汉地区Meetup，看版主们分享 TiDB POC 测试，TiDB 开发操作的想法，以及 OSS Insight...

🌠 社区资讯及活动自组织地区活动

二聚武汉！高温下的铁粉们去年12月，TiDB 首次进入武汉地区，现场跟70+位小伙伴现场交流开源数据架构，时隔半年，由武汉的社区版主（AskTUG 网站管理员）组织了本期线下交流会，跟大家一起聊聊 TiDB、Clickhouse 等技术话题，以及有原厂工程师带来 OSS Insight 现场Demo 展示。开源社区的魅力，一大部分应该就来自线下大家彼此交流目前使用的技术栈，分享彼此踩坑经历和避坑指南，有料，有笑，有“酒”（饮料茶点），这就是 TiDB 地区活动~ ...

阅读时间: 1 mins 🕑 赞: 2 ❤

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

I saw the article by the expert, which mentioned avoiding full table scans and GC. Currently, my entire table has 18,000 rows of data, and GC is functioning normally. I just don’t understand why there are still so many expired keys. Moreover, when I execute the same statement again, the execution plan is the same, but when it executes quickly, there is no key_skipped_count.

translator_bot · June 23, 2024, 7:21am

| username: h5n1 | Original post link

It is possible that this bug has not met the GC conditions and requires manual compaction.

TiDB 的问答社区 – 17 Sep 21

SQL优化扫描total_keys数的奇怪问题。

🪐 TiDB 技术问题性能调优

【 TiDB 使用环境】 v5.2.1 同样的表结构total_keys差异巨大，导致执行时间大概几百ms，而且执行时间极其不稳定。看了文档说是旧版本过多，疑问是 gc是10分钟一回收，然后看了这张表的update 15分钟的次数是500多，并且是按照id更新的。怎么会出现100W的旧版本。建表语句，数据量大概200W CREATE TABLE `device` ( `id` bigint(20) NOT NULL AUTO_INCREMENT, ...

阅读时间: 6 mins 🕑 赞: 27 ❤

translator_bot · June 23, 2024, 7:21am

| username: xfworld | Original post link

The pitfalls have been pointed out to you, make sure to avoid them in time

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

I see that the regions of this table are distributed across all TiKV and TiFlash nodes… Do I need to perform this tikvctl operation on all 10+ machines?

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

Yes, I see it. Thank you! Do I have to use tikv-ctl to execute commands on each TiKV node? Is there a quicker way to handle this? This table is frequently truncated.

translator_bot · June 23, 2024, 7:21am

| username: xfworld | Original post link

You can also operate in a cluster manner.

Manually compact the data of the entire TiKV cluster

The compact-cluster command can manually compact the entire TiKV cluster. The meaning and usage of the parameters of this command are the same as those of the compact command.

translator_bot · June 23, 2024, 7:21am

| username: weixiaobing | Original post link

What is the impact of compacting the entire cluster on the business, and what should be noted?

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

I don’t know how big the risk is and how long the execution time will be; the cluster is very large.

translator_bot · June 23, 2024, 7:21am

| username: xfworld | Original post link

You can reduce the concurrency, and it’s best to test with UAT resources, which would be the most reliable.

translator_bot · June 23, 2024, 7:21am

| username: RaftSnail | Original post link

May I ask if the issue has been resolved? Was it resolved through manual compaction? Please share, thanks.

translator_bot · June 23, 2024, 7:21am

| username: alfred | Original post link

Temporarily bypass the bug and wait for a fix.

translator_bot · June 23, 2024, 7:21am

| username: wakaka | Original post link

The cluster is too large and uncontrollable, so I didn’t operate it.

translator_bot · June 23, 2024, 7:21am

| username: system | Original post link

This topic was automatically closed 1 minute after the last reply. No new replies are allowed.