Skip to content

Find duplicate files on your system using inclusion and exclusion folder lists.

License

Notifications You must be signed in to change notification settings

cwkingjr/find_duplicate_files

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Find Duplicate Files

Just a little CLI app to learn some golang.

It scans the files in the search directory paths, excluding any hidden unix directories and any directory names listed in your exclusion list, takes a sha256 hash of each file, and outputs any files where the sha's matched on one line, inluding the full path to the files.

To Compile

With Go installed (brew install go):

go build find_duplicate_files.go

That will build an executable with the name of find_duplicate_files.

To Run

Help

./find_duplicate_files --help

Usage: find_duplicate_files -searchDirPaths <CSV of dir paths> [-excludedDirNames CSV of dir names]

Example: find_duplicate_files -searchDirPaths /Users/chuck/Documents,/Users/chuck/Desktop -excludedDirNames repos,node_modules

  -excludedDirNames value
    	A comma-seperated list of directory names that may be found in any path.
  -searchDirPaths value
    	A comma-seperated list of full paths to a directory. (Required)

Example

./find_duplicate_files -searchDirPaths /Users/chuck/Documents,/Users/chuck/Desktop -excludedDirNames repos,node_modules

It prints lines of suspected duplicate files with the full path, pipe seperated, to stdout.

The files in each line below appear to be duplicates based upon a sha256 hash of their contents.
/Users/chuck/Documents/King-Charles-MITRE-20090712.odt|/Users/chuck/Documents/fromoldmacbook/King-Charles-MITRE-20090712.odt

About

Find duplicate files on your system using inclusion and exclusion folder lists.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages