Find Duplicate Files Script
February 19, 2014
This is the script I use to find duplicate files. This is particularly useful, for example, if you have a bunch of random pictures and want to find the duplicates. However, it will find any duplicate files under the current working directory.
#!/bin/sh find -not -empty -type f -printf "%s\n" | sort -rn | uniq -d | xargs -I{} -n1 find -type f -size {}c -print0 | xargs -0 md5sum | sort | uniq -w32 --all-repeated=separate
More examples can be found here.
1 Comment