[Chicago-talk] Malformed UTF-8 character

Andy_Bach at wiwb.uscourts.gov Andy_Bach at wiwb.uscourts.gov
Mon Dec 18 12:32:11 PST 2006


> I now use something similar to `iconv -c -t UTF-8 < inputfile` to clean 
out
any UTF-8 badness.  The -c option does that for you.


Yeah, but its not so much as I want to remove them, but I want to *find* 
them as they're a ... call it syntax error, in the input data.  I was 
trying to s/// to something markable and return that so folks could clean 
up their input.  Which I can do, I just get those warnings ...

a

Andy Bach
Systems Mangler
Internet: andy_bach at wiwb.uscourts.gov
VOICE: (608) 261-5738  FAX 264-5932

Seville Dar Daigo
Tousin Busses Inaro
Nojo Demistrux
Summit Cows In
Summit Dux 


More information about the Chicago-talk mailing list