Duplicate file checker

Description

dup - Duplicate file checker, using MD5 message digests to identity duplicated files.

Synopsis

dup [OPTIONS] INDEX COMMAND FILE ...

Description

Duplicate file checker, using MD5 message digests to identity duplicated files.

The name of the INDEX must be specified. A new index will be created if it does not exist already. Note that the index is read into memory for each operation: collate operations on multiple files into one call if possible. No performance tests have been run, so there is no advice as to how large the index can realisticly be. It is currently used to index a collection of 10,000 files.

Commands are:

Options are:
Remarks:
Implemented by dup.cpp.

Format of the Index File

A very simple format is used for the index file:
13fe625700d47a6f9ab20a47de5a22ea
dup.cpp

1ebe001b770e8b4d06439e0b4564a667
test_md5.cpp

Example of Use

$ ./dup index add *.cpp
Unable to read the digest index, will create one
Added dup.cpp
Added test_md5.cpp

$ cp dup.cpp dup.dup

$ ./dup index find dup.dup
Found duplicate:
   *dup.dup
    dup.cpp

$ ./dup index remove *.cpp
Removed dup.cpp
Removed test_md5.cpp

Copyright ©1996-2006 Steven Blackburn - About MaART - MaART on SourceForge