How TiDB Generates Large Amounts of Test Data

This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: TiDB 如何生成大量测试数据

| username: KaiNiao

[TiDB Usage Environment] Testing
[TiDB Version] 6.5.0
[Reproduction Path]
[Encountered Problem: Problem Phenomenon and Impact]

I used sysbench to create a table with 300 million rows of data, but it only used less than 9G of disk space, while the disk space is 500G.

How can I quickly generate a large amount of data to fully utilize server resources? This seems a bit wasteful.

For example, does the official source have similar template datasets for import testing?

The goal is to test multi-table join queries, backup tools, data migration, etc. Experts can also recommend some testing functionalities.

[Resource Configuration]
[Attachment: Screenshot/Log/Monitoring]


| username: h5n1 | Original post link

Check with tiup bench.

| username: KaiNiao | Original post link

Okay, I’ll give it a try.

| username: Kongdom | Original post link

The official guide for testing cluster performance can be found here: 如何用 Sysbench 测试 TiDB | PingCAP 文档中心

| username: liuis | Original post link

It seems like the official documentation has it.

| username: KaiNiao | Original post link

Thank you, boss.

| username: system | Original post link

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.