[Omaha.pm] Fwd: the joys of encoding

Jay Hannah jhannah at mutationgrid.com
Wed Dec 29 11:24:41 PST 2010


Hmm... Sean: Looks like this is a 5.10.0 bug that people prepped patches for in 2008/2009 but I can't tell based on that thread if the patches got applied or not.

http://groups.google.com/group/perl.perl5.porters/browse_thread/thread/94914fe2d844f5ec/2860ab39fbae7c9c?lnk=raot&pli=1

You might check the little program George Greer posted on Apr 19 2009 and see if that fails on your perl but works on more recent ones. And try isolating your real-data test too...?

HTH,

Jay Hannah
Software Architect
jhannah at mutationgrid.com | http://mutationgrid.com | 1-402-598-7782



> On Dec 28, 2010, at 4:54 PM, Sean Baker wrote:
>> This seems to work in Perl 5.8 and dies in Perl 5.10:
>> 
>>  DB<2> x encode_entities($str, "\200-\377")
>> Assertion rx->sublen >= (s - rx->subbeg) + i failed: file "regcomp.c", line 5109 at (eval 4652)[/usr/lib/perl5/site_perl/5.10.0/x86_64-linux-thread-multi/HTML/Entities.pm:456] line 1.
>>  DB<3> x $str
>> 0         '<b>Ce forfait inclut:</b><br>Chambre Luxueuse, petit déjeuner buffet par jour, par personne, sac \x{201C}Omni\x{201D} pour les enfants, lait & biscuits (soir de l\x{2019}arrivée).'
> 
> Is HTML::Entities current (3.68)?
> 
> Can this go on Omaha.pm?


From: "Sean Baker"
Date: December 29, 2010 8:00:15 AM CST
To: "Jay Hannah" <jhannah at mutationgrid.com>
> Sure.  Yes, it's on a current version.  I'm going to go through production packages and compare against QA & dev and roll anything up or back that I need too.
> 
> Module id = HTML::Entities
>    DESCRIPTION  Encode/decode HTML entities
>    CPAN_USERID  LWWWP (The libwww-perl mailing list <libwww at perl.org>)
>    CPAN_VERSION 3.68
>    CPAN_FILE    G/GA/GAAS/HTML-Parser-3.68.tar.gz
>    UPLOAD_DATE  2010-09-01
>    DSLIP_STATUS Rmpf? (released,mailing-list,perl,functions,)
>    MANPAGE      HTML::Entities - Encode or decode strings with HTML entities
>    INST_FILE    /usr/lib/perl5/site_perl/5.10.0/x86_64-linux-thread-multi/HTML/Entities.pm
>    INST_VERSION 3.68





More information about the Omaha-pm mailing list