Due to the fact that the Chinese data in the existing MYSQL is encoded in Latin1, while in TIDB it is encoded in UTF8, using DM to migrate data from MYSQL to TIDB results in garbled Chinese characters. Can the DM migration task be specified to export with Latin1 character encoding for migration? Previously, using dumpling, the following command could be executed to export normal Chinese: /dumpling -h 127.0.0.1 -P 13306 -u root -p *#^ippbx^#* -t 16 -F 256MB -B core -o /home/tidb_cjx/tidb-community-toolkit-v7.5.0-linux-amd64/cjx --params "character_set_client=latin1,character_set_connection=latin1,character_set_results=Latin1,character_set_server=Latin1"
It seems that it is not yet possible to specify parameters for the export phase. Although the official documentation states that export parameters can be specified, I remember testing it in version 6.X and adding --params did not take effect. You can try version 7.5 to see if it works now. However, you can manually export and import the full data, and use the target-database.session parameter mentioned above for incremental data.
Where is the full export data stored in DM? I see that by default it should be in ./dumped_data, but I didn’t see this folder generated during migration.
Automatic character set conversion requires support from migration tools; otherwise, it will result in garbled text. It can only be said that DM still needs improvement.