This is a general architecture question. When using rclone copy between two cloud providers AWS S3 and Azure ADLS, does the data being moved ever hit the local disk of the machine where rclone runs or is it all in memory? What are the steps that rclone takes for a single object? Also in terms of scaling, is it only vertical or can you run rclone in a distributed fashion across multiple instances?
rclone compares the source object to the dest object.
by default, if the source modtime is newer or the size has changed,
then rclone copies the source object to the dest.
in your case, both providers support MD5 hash, using--checksum
rclone can use that to compare source to dest.
yes, can run as many instances as you want.
by default, rclone has a config file.
tho it is possible not to use a config file and create the remote on the fly.
Does that mean you can configure Rclone to use multiple instances when copying a single container? In other words, I want to increase parallelism by scaling horizontally when copying data from the same container. So state somehow needs to be kept to know which instance is working on which part of the container content. Can you link me to docs on how to set up something like that?