FileDeduplicator

Identifies duplicate files across directories to avoid redundant backups and copies.

Project Vision

Scan directories for duplicate files, with configurable comparison that can ignore certain metadata (e.g., EXIF in images, ID3 tags in audio). Internally extensible to support specialized comparison logic for different file types at the cost of additional complexity and performance overhead.

Architecture

A Scanner collects file candidates, groups by size, then hashes size-matched files.
Comparers handle type-specific equivalence (binary, image, audio), with the most basic being full file hash comparison.
Identifiers determine whether a file is a type a given comparer can handle.

Commands

`find-duplicates`

Primary command. Scans one or more directories for duplicate files with an interactive results browser.

deduper find-duplicates --path /path/to/scan
deduper find-duplicates --path /path1 --path /path2 --min-size 500MB
deduper find-duplicates --path /path/to/scan --allow-metadata-diffs
deduper find-duplicates --path /path/to/scan --exclude /path/to/scan/skip-this

Options:

-p|--path — Directories to scan (repeatable, defaults to current directory)
-x|--exclude — Subdirectories to skip (repeatable)
-s|--min-size — Minimum file size filter, supports suffixes: KB, MB, GB, TB
-m|--allow-metadata-diffs — Ignore metadata differences (ID3 tags, EXIF data) when comparing

Results browser features:

Paged list of duplicate groups, sorted by size (default) or path
Toggle sort order between size and path
Drill into a group to see matched files with filename, size, and directory
Open a file's location in Finder/Explorer
Refresh a match group to re-verify files (removes missing/changed files, drops the group if no duplicates remain)

`compare`

Compare two specific files for size and hash match.

deduper compare ./file1.txt ./file2.txt

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
.vscode		.vscode
FileDeduplicator.Common		FileDeduplicator.Common
FileDeduplicator.Tests		FileDeduplicator.Tests
FileDeduplicator		FileDeduplicator
.gitignore		.gitignore
FileDeduplicator.sln		FileDeduplicator.sln
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FileDeduplicator

Project Vision

Architecture

Commands

`find-duplicates`

`compare`

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FileDeduplicator

Project Vision

Architecture

Commands

find-duplicates

compare

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`find-duplicates`

`compare`

Packages