I need to copy several millions of files from ceph vfs to google cloud storage. I have completed TB of data migration already but now I have found one folder that has at current count over 40 Million files in it. When I tried this with sync and copy it just seems to be pending the operation for an extremely large amount of time which i expected, I let it run for 18hrs and still no copy had started. I am assuming this must be a known problem and was hoping i could find some guildance.
Run the command 'rclone version' and share the full output of the command.
I'm testing rclone now and syncing a directory that contains 32M files to Azure Blob Storage. I'm part way through the initial sync and have about 12M files (blobs) in the destination so far. With 12M blobs in the destination it takes rclone about 2 hours to start copying.
I am using the --fast-list option. Maybe it will help you out, too.
Or maybe it won't. I read some documentation today that said it may not actually be faster in some situations. I am going to try it without the option next time I start the sync and compare the "startup" times.
Im trying this method that was used in this process and it seems solid very similar to my original theory on this but when i try this command with the --no-traverse option i get this error
Transferred: 133.210 KiB / 133.210 KiB, 100%, 126.048 KiB/s, ETA 0s
Errors: 10 (retrying may help)
Elapsed time: 1.4s
2024/01/12 09:29:28 INFO :
Transferred: 133.210 KiB / 133.210 KiB, 100%, 126.048 KiB/s, ETA 0s
Errors: 10 (retrying may help)
Elapsed time: 1.4s
2024/01/12 09:29:28 DEBUG : 25 go routines active
2024/01/12 09:29:28 Failed to copy with 10 errors: last error was: googleapi: Error 400: Cannot insert legacy ACL for an object when uniform bucket-level access is enabled. Read more at Uniform bucket-level access | Cloud Storage | Google Cloud, invalid
here im only testing with a small excised portion of the list, I can't modify the uniform bucket-level access. So now im looking for another option to pass to rclone.
quick question maybe you know. im using the following flags --no-traverse --fast-list --ignore-checksum --transfers 255 --checkers 255 --gcs-bucket-policy
i thought the ignore-checksum would not run the post copy check but mine is still doing that? any ideas why ??