Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.Original topic: dumpling导出数据出现数据多余的错误数据
[TiDB Usage Environment] Production Environment
[TiDB Version] 5.4.1
[Encountered Problem: Phenomenon and Impact]
When using dumpling to export data based on conditions, erroneous extra data appeared.
/tidb-toolkit-v5.4.1-linux-amd64/bin/dumpling -h 192.168.196.xxx -P 4000 -u root -t 4 -r 200000 -p 'xxxx -T xxx.brc_co_code_data --filetype sql -o /databak/table_bak/2022121302 --where “(PRD_TIME < ‘2019-01-01 00:00:00’)”
When importing data, a primary key conflict occurred. Upon checking the backup file, it was found that the primary key column value 00600180414316264080 appeared twice, and the entire row values were different, as shown in the screenshot. Querying the table records revealed that there was only one actual record.
[Attachment: Screenshot/Log/Monitoring]