Source and destination as single files errors in rclone check

dr.mcgillicuddy · January 19, 2020, 7:34am

Hopefully I'm missing something, and I can just delete this post and pretend it never happened. But I uploaded a file, and now want to check my local version against the uploaded version.

I changed the name after uploading, but checking by size or MD5 should still be okay. Except... I don't even get that far.

Here's what I'm doing (there are no line breaks):

rclone copy ~/projects/moduleone.py encryptedremote:projects/old/project1module1.py

then

rclone check ~/projects/moduleone.py encryptedremote:projects/old/project1module1.py

gives

2020/01/19 04:20:00 Failed to create file system for "encryptedremote:projects/old/project1module1.py": is a file not a directory

I am aware that it is not a directory. But rclone check is supposed to work on single files. Yes?
What say ye?

Animosity022 · January 19, 2020, 2:17pm

You can't use rclone check with an encrypted remote, you have to use crypthcheck.

If you want to move a single file, you'd use copyto as copy is expecting a folder/directory:

felix@gemini:~$ rclone copy /etc/hosts gcrypt:hosts
felix@gemini:~$ rclone lsf gcrypt:hosts
hosts
felix@gemini:~$ rclone lsd gcrypt:
          -1 2018-06-17 10:24:02        -1 Movies
          -1 2017-04-18 16:14:26        -1 TV
          -1 2020-01-19 09:14:42        -1 hosts
felix@gemini:~$ rclone lsl gcrypt:hosts
      227 2019-11-29 12:21:40.797000000 hosts

So you'd use:

rclone copyto hosts gcrypt:hosts

ncw · January 19, 2020, 2:18pm

You should find this works

rclone check ~/projects/moduleone.py encryptedremote:projects/old/

Explanation:

All the rclone commands like sync, copy, move, check work with two directories.

However, as a special case, if the first argument points to a file, then rclone adds in a filter just pointing to that file.

So

rclone check ~/projects/moduleone.py encryptedremote:projects/old/

is equivalent to

rclone check ~/projects/ --include "/moduleone.py" encryptedremote:projects/old/

Make sense?

Rclone has copyto and moveto whose src and dst can be either (dir, dir) or (file, file). There is no checkto though.

dr.mcgillicuddy · January 19, 2020, 8:49pm

@ncw
Yes--this makes perfect sense.
But it leads to a different set of issues, because the file has been renamed at the destination, so the "implied" filter (if I can call it that?) causes different errors, as filtering the destination directory returns zero matching files--and rightly so.

rclone check ~/projects/moduleone.py encryptedremote:projects/old/
2020-01-19 14:28:46 NOTICE: Relocated Items: Can't follow symlink without -L/--copy-links
2020-01-19 14:28:46 ERROR : moduleone.py: File not in Encrypted drive 'encryptedremote:projects/old/'
2020-01-19 14:28:46 NOTICE: Encrypted drive 'encryptedremote:projects/old/': 1 files missing
2020-01-19 14:28:46 NOTICE: Encrypted drive 'encryptedremote:projects/old/': 1 differences found

I do not understand the first part of this, because there are no symlinks anywhere. My local file is not a symlink, and the destination is google drive.

So, it sounds like there actually may not be a way to run rclone check against two files with different names? Or even rclone cryptcheck in such a way as to check only hashes, ignoring names entirely?

@Animosity022

You can't use rclone check with an encrypted remote, you have to use cryptcheck

Indeed, you can use rclone check, but it will not do a checksum, just a "quick check" i.e. size

If you want to move a single file, you'd use copyto as copy is expecting a folder/directory:

That's simply not true. rclone copy will work on a file or a directory. copyto would have allowed me to change the name of the file at the time of the copy, so I can see the confusion, but I changed the name after copying so I used copy instead. I could have been more clear on that, I apologize.

Animosity022 · January 19, 2020, 9:38pm

Nah, you misunderstand what that is referring to. The destination is expecting a directory and your command above:

Would have made a directory called "project1module1.py" with a file in that directory.

If you wanted to run your exact command above, using copyto instead of copy works while your command does not.

There isn't much point of having a checksummed remote and doing a size check. While it's possible, it really doesn't make sense. The recommended approach is to use cryptcheck as that is why it was created.

dr.mcgillicuddy · January 19, 2020, 10:03pm

@Animosity022
You're right--I apologize--I was trying to simplify because I did rename the file, but I did it via rclone mount as part of an attempt to organize all of my stuff... Outcome TBD.

ncw · January 20, 2020, 10:30am

There is probably a symlink in the ~/projects directory. See above for why rclone is scanning it!

I don't think there is at the moment, no, you'd need the rclone checkfile variant (which doesn't exist yet!)

It sounds like you'd like to be comparing the hashes of the encrypted data.

Rclone would need some new flags to hashsum

rclone hashsum md5 --crypted cryptdrive:
rclone hashsum md5 --cryptkey cryptdrive: /path/to/local

--crypted would show the crypted hashes from a crypt drive (they would normally be blank)

--cryptkey cryptedremote: would use cryptedremote: to encrypt the files before printing their encrypted hashes.

This would then enable you to do a manual compare of encrypted md5sums.

This is essentially what rclone cryptcheck does under the hood.

Would something like that be useful?

dr.mcgillicuddy · January 21, 2020, 12:54am

Would something like that be useful?

I can think of a couple of situations in which that would be useful:

Checking the integrity of a single file that has been renamed either during or after upload, as essentially the equivalent of rclone check to use after rclone copyto
To find duplicates of a single file across multiple directories in a crypt remote prior to uploading (e.g. if I want to see whether ~/movie.mp4 is anywhere in a mount, regardless of name, I could say rclone checkfile ~/movie.mp4 cryptdrive: and rclone would locate duplicates, even if they were buried somewhere and called movie2.mp4)

#2 might be sort of an outlier case. But rclone copy has rclone check so it seems consistent to have an rclone check equivalent for rclone copyto which we currently do not have on a single-file basis--only for directories.

It occurred to me that if I could just run rclone hashsum on each individual file, I could pipe both outputs to diff and achieve the result I'm looking for. But I think we don't currently have anything that will hash a single file on a crypted remote? hashsum and md5sum return errors, and cryptcheck does not give output in a way that I could pipe.

ncw · January 21, 2020, 8:21am

This would be a very expensive operation as it would read the nonces from all the remote files and encrypt and then hash all your local files...

Perhaps adding another flag to rclone check rclone cryptcheck maybe --file or something like that to make it think both the arguments are files might be simplest?

That is essentially what I was proposing with the --crypted and --cryptkey flags above, so you could read hashes from crypts and the local file system but encrypted like this crypt. This would work for single files.

dr.mcgillicuddy · January 21, 2020, 9:06am

This seems straightforward because it would essentially mean "take the full source and destination paths exactly as they are each written". That's a simple behavior to explain to users.

This would be a very expensive operation as it would read the nonces from all the remote files and encrypt and then hash all your local files...

Wouldn't it just read the nonces, and encrypt and hash a single local file over and over? Still expensive. But the same number of hash operations as checking full directories against eachother. (a directory of 20 files, and a remote of 20 files is 20 hash operations with cryptcheck; 1 local file checked against a directory of 20 files is 20 hash operations with checkfile).

Either way, de-duping your rclone backups is probably something another utility can do, so I withdraw that example. I was just imagining what such a command might do if one ran it with a source file and a destination directory, instead of a file on each end...

ncw · January 21, 2020, 10:16am

Do you want to make a new issue on github about that? It should be relatively straight forward I think.

I guess it might!

So you think it is worth implemeting the --crypted and --cryptkey flags to hashsum? Or is that too esoteric do you think?

dr.mcgillicuddy · January 22, 2020, 1:50am

I think it will get less use if it is implemented as a flag to hashsum.

I almost always run check or cryptcheck after a copy operation, just out of an abundance of caution.
Having a checkfile operation seems logical if we already have what could perhaps be considered checkdir and, accordingly, a good way for users to keep tabs on their data without having to think too much.

This is your baby, and development direction should be what makes sense to you, first and foremost.

ncw · January 22, 2020, 2:10pm

OK let's have a go at a --file flag to check and cryptcheck.

I prefer the --file flag rather than proliferating checkfile and cryptcheckfile sub commands!

Can you please make a new issue on github about that? Can you put a link to this page in it please!

Thanks

dr.mcgillicuddy · January 23, 2020, 4:00am

Made. #3897

You asked me that twice, and I neglected to do so the first time. I apologize.

ncw · January 23, 2020, 10:09am

Thank you for making the issue

system · April 22, 2020, 10:16am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.