[Edinburgh-pm] Marpa::HTML

Miles Gould miles at assyrian.org.uk
Fri May 20 03:35:48 PDT 2011


There's been some chat on the lists about parsing real-world broken
HTML in Perl. I noticed this

http://search.cpan.org/dist/Marpa-HTML/lib/Marpa/HTML/Doc/HTML.pod

and wondered if anyone might be interested, or perhaps had used it?
The author claims that his underlying parsing engine
(http://search.cpan.org/~jkegl/Marpa-0.204000/lib/Marpa/Doc/Marpa.pod)
can parse any context-free grammar in O(n^3) time and any LL/LR/etc
grammar in O(n) time, and has nice error reporting. On the other hand,
he says it's alpha software (and Marpa::XS, released only a month ago,
is presumably doubly alpha).

There's more on Marpa and the author's mission to kill Yacc at his
blog, http://blogs.perl.org/users/jeffrey_kegler/.

Miles.


More information about the Edinburgh-pm mailing list