[pm-h] maildir - remove duplicate messages

Russell L. Harris rlharris at oplink.net
Sun Mar 28 03:28:29 PDT 2010


I have on the order of 10 Gb of mail files.

Most of the files are in maildir format; a few are in mbox format.

The system is Debian GNU/Linux.

I would like to eliminate duplicate messages.  There appear to be, on 
the average, perhaps four or five copies of each message.

I also would like to sort the messages on the To: and From: fields, 
saving only certain matches.

I have been searching with Google for "maildir delete duplicate perl", 
but I have not yet found a script which looks promising.

Is there a good standard approach, script, or application for this problem?

RLH


More information about the Houston mailing list