From karjala at karjala.org Sun Jul 27 03:40:37 2003 From: karjala at karjala.org (Alexander Karelas) Date: Mon Aug 2 21:22:59 2004 Subject: [Athens-pm] Unicode to Greek Message-ID: <20030727084037.GA22256@karjala.org> I'm trying to change the unicode file here [1] to the standard iso-8859-7 character set, in order to process it with Perl afterwards. [1] http://www.ypan.gr/flash_fuel/ath/ath.dat After having installed the module Unicode::MapUTF8, which also installs the shell utility 'umap', I did: $ umap utf16:greek output For some reason the translation stops after a couple of hundred of characters, and the output file is a lot smaller than the original. Does anyone have a clue as to what's wrong there? Thx, - Alexander P.S. I can't find a perl function to do the mapping from utf16... while there is a from_utf8 in Unicode::MapUTF8, from_utf16 seems like it's missing. Any help appreciated Thx From karjala at karjala.org Sun Jul 27 03:28:19 2003 From: karjala at karjala.org (Alexander Karelas) Date: Mon Aug 2 21:22:59 2004 Subject: [Athens-pm] Unicode to Greek In-Reply-To: <20030727084037.GA22256@karjala.org> References: <20030727084037.GA22256@karjala.org> Message-ID: <3F238D23.8040105@karjala.org> Just found a solution... use the gnu utility 'recode' as a filter: "cat | recode UTF-16..ISO-8859-7" Not very perlish, but it's okay... Alex Alexander Karelas wrote: >I'm trying to change the unicode file here [1] to the standard >iso-8859-7 character set, in order to process it with Perl afterwards. > >[1] http://www.ypan.gr/flash_fuel/ath/ath.dat > >After having installed the module Unicode::MapUTF8, which also installs >the shell utility 'umap', I did: >$ umap utf16:greek output > >For some reason the translation stops after a couple of hundred of >characters, and the output file is a lot smaller than the original. > >Does anyone have a clue as to what's wrong there? > >Thx, >- Alexander > >P.S. I can't find a perl function to do the mapping from utf16... while >there is a from_utf8 in Unicode::MapUTF8, from_utf16 seems like it's >missing. > >Any help appreciated > >Thx >_______________________________________________ >Athens-pm mailing list >Athens-pm@mail.pm.org >http://mail.pm.org/mailman/listinfo/athens-pm > > >