[Kc] cleaning up unicode?
Garrett Goebel
garrett at scriptpro.com
Tue Mar 4 06:31:53 CST 2003
John Reinke wrote:
>
> On Mon, 2003-03-03 at 09:15, Garrett Goebel wrote:
> > > I have a text file which contains a character I am
> > > guessing as being considered a unicode character.
> > > It is the letter 'u' with the accent mark over it
> > > that looks like an apostrophe, which appears in
> > > some text editors as <FA>.
[...]
> > > I'd like to convert any of those characters to be
> > > regular ASCII characters, most likely with a tr
> > > command, but I haven't been able to find a way to
> > > match that character. Any suggestions?
>
> > It'd be nice to know what you're actually trying to
> > accomplish.
>
> What I'm trying to accomplish is remove the accent marks
> from characters, essentially reducing everything down to
> 7-bit ASCII. Since you're asking, the strings will become
> file names. I want to create a subroutine that will
> convert a string to something valid for my file system.
What file system(s) does the filename need to be valid for?
--
Garrett Goebel
IS Development Specialist
ScriptPro Direct: 913.403.5261
5828 Reeds Road Main: 913.384.1008
Mission, KS 66202 Fax: 913.384.2180
www.scriptpro.com garrett at scriptpro.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.pm.org/pipermail/kc/attachments/20030304/d30b6a96/attachment.htm
More information about the kc
mailing list