[pm-h] maildir - remove duplicate messages

Sean Richards seangr at gmail.com
Sun Mar 28 22:09:54 PDT 2010


Its probably obvious, but there are utilities to convert mbox to maildir, so
at least you can do that with the black sheep files so you only have to
compare messages in one format.

On Sun, Mar 28, 2010 at 5:28 AM, Russell L. Harris <rlharris at oplink.net>wrote:

> I have on the order of 10 Gb of mail files.
>
> Most of the files are in maildir format; a few are in mbox format.
>
> The system is Debian GNU/Linux.
>
> I would like to eliminate duplicate messages.  There appear to be, on the
> average, perhaps four or five copies of each message.
>
> I also would like to sort the messages on the To: and From: fields, saving
> only certain matches.
>
> I have been searching with Google for "maildir delete duplicate perl", but
> I have not yet found a script which looks promising.
>
> Is there a good standard approach, script, or application for this problem?
>
> RLH
> _______________________________________________
> Houston mailing list
> Houston at pm.org
> http://mail.pm.org/mailman/listinfo/houston
> Website: http://houston.pm.org/
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.pm.org/mailman/private/houston/attachments/20100329/94c5f0d8/attachment.html>


More information about the Houston mailing list