[Chicago-talk] parsing HTML

Jay Strauss me at heyjay.com
Fri Feb 23 13:18:21 PST 2007


Hi,

I need to parse out the text from HTML like:

<SPAN class="main-body"><B>Street Address</B></SPAN>

to pluck out "Street Address"

or

<SPAN class="main-body">
                                <span id="UcGeoResult11_lbZipCode"><font color="
Navy">60643</font></span></SPAN>

to pluck out "60643"

Would you suggest using a regex (that I can't get to work) or some
module (like HTML::Parser)?

Thanks
Jay


More information about the Chicago-talk mailing list