[tpm] I wish I was better at regex's
Uri Guttman
uri at StemSystems.com
Wed Mar 9 12:36:13 PST 2011
>>>>> "RD" == Richard Dice <richard.dice at gmail.com> writes:
RD> Those are getting closer, but honestly this can go way beyond the
RD> realm of what regexps can provide. For example, the text in the
RD> comment area... In theory, it could contain text like
RD> "key"="Val\"ue" ; tricked you!!
RD> To fully, correctly solve this you need a full parser. Like
RD> Parse::RecDescent
RD> Which is like hitting a melon with an atomic bomb.
you don't need a full parser. this can be done with Text::Balanced from
the looks of it. getting a regex to do it is insane though. just scan
from the left for each possible field. get the key=value with possible
quoted values (i don't have the spec in my head), then check for a
comment and skip/delete it. the module will handle escaped quotes and
most anything you could legally put there. this is very easy when parsed
properly and not easy with a regex.
uri
--
Uri Guttman ------ uri at stemsystems.com -------- http://www.sysarch.com --
----- Perl Code Review , Architecture, Development, Training, Support ------
--------- Gourmet Hot Cocoa Mix ---- http://bestfriendscocoa.com ---------
More information about the toronto-pm
mailing list