[Pdx-pm] A call for XML documents

Erik Hollensbe erik at hollensbe.org
Sun Nov 29 08:02:09 PST 2009


On 11/29/2009 10:58 AM, Tyler Riddle wrote:
> Hello fellow mongers,
>
> I'm putting together a software package to unmarshal XML documents as
> fast as possible and I'm currently trying to bang out the API. My use
> case so far is the MediaWiki dump file format but I'd like to see
> other real world XML document examples to throw at my API and see how
> it stands up. Anyone have some XML documents they deal with or know
> where I can find a repository of them? Links would be appreciated!
>
> Thanks,
>
> Tyler Riddle
>    

Python's "feedparser" project had a ton of test documents that were very 
handy when I wrote an atom parser. Maybe you'll find them useful.

Also, there is an XML standard battery, but I cannot remember the name; 
I imagine the libxml project does, though.

-Erik


More information about the Pdx-pm-list mailing list