[Chicago-talk] Perl script run slower for utf8

Andrew Rodland andrew at cleverdomain.org
Thu Jan 26 17:01:13 PST 2012


Sometimes you can either do the wrong thing quickly, or do the right thing
slowly. This is one of those times. Unicode support slows down a lot of
matching operations because character class matching isn't just a matter of
looking at bits in 256-entry bitmaps anymore.

I would, however, check whether you have Text::CSV_XS installed, as it's
faster than the pure-perl Text::CSV, and its speed is probably
less-affected by Unicode.

On Thu, Jan 26, 2012 at 3:58 PM, tiger peng <tigerpeng2001 at yahoo.com> wrote:

> Hello all,
>
> I just made a Perl script for parsing large CSV files (with Text::CSV).
> When I enable Unicode (with the three use uncommented), it took as twice
> longer time. Is it normal? Is there any way to speed it up?
>
> Thanks,
>
> #use utf8;
> #use encoding "utf-8";
> #use open ':encoding(utf8)';
>
> _______________________________________________
> Chicago-talk mailing list
> Chicago-talk at pm.org
> http://mail.pm.org/mailman/listinfo/chicago-talk
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.pm.org/pipermail/chicago-talk/attachments/20120126/83ae5153/attachment.html>


More information about the Chicago-talk mailing list