How to Efficiently Modify the ID Column of a Large Table?

translator_bot · June 21, 2024, 9:32am

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: 如何高效修改大表id列？

| username: 江湖故人

Is there a good way to modify the originally unordered id column into incremental numbers for a 5kw test table?

translator_bot · June 21, 2024, 9:32am

| username: WalterWj | Original post link

Export the dumpling backup, manually modify the table structure to be auto-increment, and then import it into TiDB using lighting. The table is essentially rebuilt.

translator_bot · June 21, 2024, 9:32am

| username: 江湖故人 | Original post link

Unable to manually clear the data in the id column.

translator_bot · June 21, 2024, 9:32am

| username: zhanggame1 | Original post link

If the data rows are not long for 5000KW, directly create a new table, use auto-increment for the ID, and then insert into select from the old table. The tidb_mem_quota_query can be adjusted to 10G at the session level, which should be sufficient.

translator_bot · June 21, 2024, 9:32am

| username: 小龙虾爱大龙虾 | Original post link

Deleting the ID will clear it out

translator_bot · June 21, 2024, 9:32am

| username: dba远航 | Original post link

First delete the ID column, then try adding the ID column with auto-increment.

translator_bot · June 21, 2024, 9:32am

| username: Kongdom | Original post link

Is it changing the id column to an auto-increment column, or changing the existing id column values to incrementing numbers?

translator_bot · June 21, 2024, 9:32am

| username: zhanggame1 | Original post link

If it is a clustered table, the auto-increment ID must be the primary key column, and the primary key column is not allowed to be modified.

translator_bot · June 21, 2024, 9:32am

| username: Jellybean | Original post link

You can try the following approach:

Delete the id column from the original table.
Use dumpling to export the entire table. Since the id column is removed, the exported data will not contain the id column.
- There is an issue here: currently, deleting primary key columns or columns related to composite indexes is not supported. You need to delete the index first. If it is a clustered index, deletion is also not supported. If deletion is possible, proceed to the next steps.
- If not, you may need to recreate the table and import the data.
Modify the schema file in the exported files to add an auto-increment id column to the new table. Note that you should use MySQL compatibility mode here; otherwise, the obtained id might be unique but not strictly incremental.
- Using MySQL compatibility mode ensures that the ID is unique and monotonically increasing.
- AUTO_INCREMENT | PingCAP 文档中心
Use lightning to import the data.