[Kc] cleaning up unicode?
Scott Kahler
scottk at uclick.com
Tue Mar 4 06:38:33 CST 2003
I haven't seen any package that do what you are looking for John. I've
ran into a big of a problem with this recently as I've been working with
high value ASCII too. About the only thing I think you'd be able to do
is a list of one to one conversions. The problem with this that i've
run into is that unix, windows and mac all have a different character
map regaurding character outside our standard alphanumerics.
John Reinke wrote:
>>
>>
>
>What I'm trying to accomplish is remove the accent marks from
>characters, essentially reducing everything down to 7-bit ASCII. Since
>you're asking, the strings will become file names. I want to create a
>subroutine that will convert a string to something valid for my file
>system. While I could just eliminate the accented characters, it would
>make sense to retain the letter part, and eliminate the additional
>punctuation - no offense intended toward your "beautiful" (auf Deutsch)
>example, Garrett.
>
>I thought that this might have been common enough that someone had a
>quick formula that could handle this. Perhaps not. I'll have to look for
>an existing package or code something up from scratch...
>
>Thanks,
>John
>
>
>
>
--
Scott Kahler
=-=-=-=-=-=-=-=-=-=-=-=-=
DB Hacker
http://www.uclick.com
816-210-8884
scottk at uclick.com
=-=-=-=-=-=-=-=-=-=-=-=-=
Brain: Gone at last, obsequious buffoons.
Pinky: Right-O Brain. Narf! Obsequious!
Brain: Pinky, do you have any idea what obsequious means?
Pinky: No, but it sounds squishy! Ooo I love squishy!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.pm.org/pipermail/kc/attachments/20030304/11dee59f/attachment.htm
More information about the kc
mailing list