[San-Diego-pm] accents

Joel Fentin joel at fentin.com
Tue Oct 26 23:02:26 CDT 2004

Douglas Wilson wrote:

> Joel Fentin wrote:
>> I need to see if what the Spanish language operator enters is 
>> contained in a long hunk of text. Something like this:
>> if($X =~ /$Y/){[Do something]}
>> The operator might enter josé, JOSÉ, or jose. He might enter niño, 
>> NIÑO, or nino.
>> An i modifier to m// will take care of case. Is there any fell swoop 
>> way of taking care of accents?
> Those characters all have the high order bit on, so here is a crude
> way:
> my $str = "josé, JOSÉ, or jose. He might enter niño, NIÑO";
> my @funny_chars = $str =~ /([^\x00-\x7F])/g;
> print "@funny_chars\n";


Although what you say is interesting, it gets me no closer to my 
solution. Knocking the high order bit from ñ does not turn it into n. 
The one is a binary 11110001 and the other is 01101110.

In order to do a language neutral look into $Y to see if $X is within, I 
will probably have to knock the accents off all such characters in both 
$X & $Y. What I am hoping is that Perl has a fell-swoop method for this.

I looked in my books, and on the Internet, but much of what I saw I 
couldn't understand (Unicode) and none of it seemed on target.
Joel Fentin    tel: 760-749-8863    FAX: 760-749-8864
Contact me:    http://fentin.com/me/ContactMe.html
Biz:           http://fentin.com
Personal:      http://fentin.com/me/

More information about the San-Diego-pm mailing list