What is the best duplicate file finder, that preserves my source of truth?

parkercp@alien.top · 1 year ago

speculatrix@alien.top · 1 year ago

Write a simple script which iterates over the files and generates a hash list, with the hash in the first column.

find . -type f -exec md5sum {} ; >> /tmp/foo

Repeat for the backup files.

Then make a third file by concatenating the two, sort that file, and run “uniq -d”. The output will tell you the duplicated files.

You can take the output of uniq and de-duplicate.

jerwong@alien.top · 1 year ago

I think you need a \ in front of the ;

i.e.: find . -type f -exec md5sum {} \; >> /tmp/foo