chopping up mbox

Michael Lush mjlush at ebi.ac.uk
Mon Oct 29 08:36:34 PDT 2007


I've about 10Gb of mailboxes,  I'd like to break down into individual
emails so I can dump them into a simple searchable table.  It would be
quite nice (but no means critical) to extract for 'From' 'To' dates 
attachments etc.

We have had problems with these files when trying to import them 
into squirrel-mail (truncations and messages running together, mostly 
due to non-standard spam formatting).

I had a quick peek at CPAN and found (inevitably) there's more than one 
way to do it.  Mail::Mbox::MessageParser appears to be the simple option, 
Mail::Box::Mbox seems to have all the parsing bells and whistles.

Are ther any other packages I should consider?

--
Michael
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Michael John Lush PhD			Tel:44-1223 492626
Bioinformatician 
HUGO Gene Nomenclature Committee	Email: hgnc at genenames.org
European Bioinformatics Institute
Hinxton, Cambridge
URL: http://www.genenames.org
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~



More information about the MiltonKeynes-pm mailing list