Mail Deduplicate

Command-line tool to deduplicate mails from a set of boxes.

Stable release: Last release Python versions

Development: Unittests status Documentation Status Coverage Status


  • Duplicate detection based on cherry-picked and normalized mail headers.

  • Source and deduplicate mails from multiple sources.

  • Reads and writes to mbox, maildir, babyl, mh and mmdf formats.

  • Multiple duplicate selection strategies based on size, content, timestamp, file path or random choice.

  • Copy, move or delete the resulting set of mails after the deduplication.

  • Dry-run mode.

  • Protection against false-positives by checking for size and content differences.



This package is available on PyPi, so you can install the latest stable release and its dependencies with a simple pip call:

$ pip install mail-deduplicate


Docs are hosted on Read the Docs.


Indices and tables