SPUG: XPath on (less-than-perfect) HTML

Michael R. Wolf MichaelRWolf at att.net
Tue Dec 8 23:59:58 PST 2009


On Dec 8, 2009, at 7:44 PM, Yitzchak Scott-Thoennes wrote:

> On Tue, Dec 8, 2009 at 5:43 PM, Michael R. Wolf  
> <MichaelRWolf at att.net> wrote:

[...]


>> Could the only difference be that I've got to be explicit with the  
>> XML::LibXML parser about recovering on non-well-formed input while  
>> the HTML one already (tacitly) expects non-well-formed.
>
> No personal experience, but it's not just about recovering, but
> recovering the way a browser would have interpreted the HTML.

Good point.  Thanks.  Do you know if it has a "quirks mode"?


> From the TreeBuilder POD:
>

[...]

>> -- because,

>> as Roseanne Rosannadanna once said, "it's always something".


Or, as my Dad said.  If it's not one thing, it's ... 10!


-- 
Michael R. Wolf
     All mammals learn by playing!
         MichaelRWolf at att.net






More information about the spug-list mailing list