[Kc] cleaning up unicode?

Garrett Goebel garrett at scriptpro.com
Tue Mar 4 06:31:53 CST 2003


John Reinke wrote:
> 
> On Mon, 2003-03-03 at 09:15, Garrett Goebel wrote:
> > > I have a text file which contains a character I am 
> > > guessing as being considered a unicode character.
> > > It is the letter 'u' with the accent mark over it
> > > that looks like an apostrophe, which appears in
> > > some text editors as <FA>.
[...] 
> > > I'd like to convert any of those characters to be
> > > regular ASCII characters, most likely with a tr
> > > command, but I haven't been able to find a way to
> > > match that character. Any suggestions?
>  
> > It'd be nice to know what you're actually trying to 
> > accomplish. 
> 
> What I'm trying to accomplish is remove the accent marks
> from characters, essentially reducing everything down to
> 7-bit ASCII. Since you're asking, the strings will become
> file names. I want to create a subroutine that will
> convert a string to something valid for my file system.

What file system(s) does the filename need to be valid for?

--
Garrett Goebel
IS Development Specialist

ScriptPro                   Direct: 913.403.5261
5828 Reeds Road               Main: 913.384.1008
Mission, KS 66202              Fax: 913.384.2180
www.scriptpro.com          garrett at scriptpro.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.pm.org/pipermail/kc/attachments/20030304/d30b6a96/attachment.htm


More information about the kc mailing list