Using DM for Full Data Import, but No Data Visible

translator_bot · June 22, 2024, 12:46pm

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: 使用dm进行全量导入，但是没有看到数据

| username: 烂番薯0

Dear experts, I am using DM for full data import, but I don’t see any data in the database. What could be the reason for this?

Data source

Task

This is the status

However, no data is being imported, and the speed is very slow. How can I improve this?

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

Currently, this number of rows is very slow, increasing one by one.

translator_bot · June 22, 2024, 12:46pm

| username: jackerzhou | Original post link

Use “*” directly for the tbl-name parameter, do not add the database name.

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

So should I stop the task now, and then what should I do?

translator_bot · June 22, 2024, 12:46pm

| username: jackerzhou | Original post link

Delete this task and reconfigure it.

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

It seems there is no delete option, expert.

translator_bot · June 22, 2024, 12:46pm

| username: 朵拉大虾 | Original post link

I remember there was a database naming convention, check the official documentation.

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

Buddy, I followed your instructions and made the changes, but it resulted in an error.

translator_bot · June 22, 2024, 12:46pm

| username: CuteRay | Original post link

This error indicates that the table does not exist in your downstream.

translator_bot · June 22, 2024, 12:46pm

| username: TI表弟 | Original post link

Read more documentation.

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

However, I am using the full import mode. Shouldn’t the downstream table be imported from the upstream by DM?

translator_bot · June 22, 2024, 12:46pm

| username: TI表弟 | Original post link

There is no problem with the table names and such because I don’t need to sync all the tables. I listed them one by one.

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

Oh, okay.

translator_bot · June 22, 2024, 12:46pm

| username: TI表弟 | Original post link

The documentation contains examples, and it is written very clearly here. I suggest reading it carefully three times before asking questions: docs-cn/dm/task-configuration-file-full.md at release-6.5 · pingcap/docs-cn · GitHub

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

Okay, thank you, boss.

translator_bot · June 22, 2024, 12:46pm

| username: CuteRay | Original post link

I suggest you stop the task, clear the downstream, and then re-import, especially dm_meta. The configuration file doesn’t seem to have any issues. By the way, please provide the TiDB and DM versions.

translator_bot · June 22, 2024, 12:46pm

| username: CuteRay | Original post link

I suspect this error is due to DM switching to incremental import. Since you previously stopped the task, modified the configuration file, and restarted it, but didn’t clean up the downstream properly, it resulted in skipping table creation and directly executing data import operations. The documentation also mentions that restarting a task requires cleaning up the downstream. When restarting the task, add the --remove-meta configuration.
Data Migration FAQ | PingCAP Documentation Center

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

Hmm… The version is V6.5.0. After I started the downstream database task, no data was generated. Then, I followed the official procedure to modify the file and restarted the cluster, but it reported the above error, saying that there is no table in the downstream…

translator_bot · June 22, 2024, 12:46pm

| username: CuteRay | Original post link

I understand, it’s normal for the downstream to have no data when the task is just created. This is because the synchronization logic of DM is to pull the upstream data into the dm-worker at the moment you start the task, then fully import it into the downstream, and then execute incremental synchronization. When you start the task, you can observe that your dm-worker will have an additional directory, which stores the SQL files of the upstream table structure and data.

translator_bot · June 22, 2024, 12:46pm

| username: 烂番薯0 | Original post link

Oh, okay, thank you, boss.