Thanks for your response Bruce. I neglected to mention I was calling into a
program that I couldn't see under the covers and it appears as though the
iconv is working as expected (I wrote a test program) and converts to UTF-8
just fine, but the program I was calling into wasn't converting from 37 to
1208 as I had expected (longer story that I wont digress into).
BTW, for anybody else looking for some good documentation on some of the
things I have hit on in the past week researching UTF-8 and CCSID's see the
following:
http://en.wikipedia.org/wiki/CCSID
http://en.wikipedia.org/wiki/UTF-8
http://www.tachyonsoft.com/iso88591.htm
http://www.tachyonsoft.com/uc0000.htm // This link does a good job of
addressing my original question
http://www-306.ibm.com/software/globalization/ccsid/ccsid819.jsp
http://www-306.ibm.com/software/globalization/ccsid/ccsid1208.jsp
http://publib.boulder.ibm.com/infocenter/iseries/v5r3/index.jsp?topic=/rzaha
/fileenc.htm
http://www.cl.cam.ac.uk/~mgk25/unicode.html
http://tools.ietf.org/html/rfc3629
http://czyborra.com/utf/
Thanks to all who responded.
Aaron Bartell
http://mowyourlawn.com
-----Original Message-----
From: rpg400-l-bounces@xxxxxxxxxxxx [
mailto:rpg400-l-bounces@xxxxxxxxxxxx]
On Behalf Of Bruce Vining
Sent: Thursday, July 05, 2007 9:44 AM
To: RPG programming on the AS400 / iSeries
Subject: Re: CCSID conversion tables
Actually Ü is x'C39C' in UTF-8. Is it possible that you're converting
inplace and that insufficient space has been allocated for iconv to return
the two byte result in? This might cause you to think iconv is returning
x'DC' when in reality it's simply leaving your input data as is.
When working with UTF8 you need to remember that anything greater than 7-bit
ASCII will convert into a multi-byte encoding.
Bruce Vining
"albartell" <albartell@xxxxxxxxx>
Sent by: rpg400-l-bounces@xxxxxxxxxxxx
07/05/2007 08:53 AM
Please respond to
RPG programming on the AS400 / iSeries <rpg400-l@xxxxxxxxxxxx>
To
"'RPG programming on the AS400 / iSeries'" <rpg400-l@xxxxxxxxxxxx>
cc
Subject
CCSID conversion tables
Hi all,
I can't believe I haven't come across a need to see translation tables
from
one CCSID to the next, but today I need it :-) Traditionally, or at least
before I started using iconv and when I used QDCXLATE, I would simply do a
WRKTBL and select the appropriate table. I have a latin capital letter U
with diaeresis that is causing me problems when converting to UTF-8 from
ASCII ISO88591. Does anyone know if IBM has a translation site for
CCSID's
similar to this one I found?
<
http://www.tachyonsoft.com/uc0000.htm>
http://www.tachyonsoft.com/uc0000.htm
I am using iconv to convert to CCSID 1208 (UTF-8) and the U with diaeresis
*should* be converting to x'C3' (as far as I can tell) but is remaining at
a
x'DC'.
TIA,
Aaron Bartell
<
http://mowyourlawn.com>
http://mowyourlawn.com
As an Amazon Associate we earn from qualifying purchases.