× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



Thanks for your response Bruce. I neglected to mention I was calling into a
program that I couldn't see under the covers and it appears as though the
iconv is working as expected (I wrote a test program) and converts to UTF-8
just fine, but the program I was calling into wasn't converting from 37 to
1208 as I had expected (longer story that I wont digress into).


BTW, for anybody else looking for some good documentation on some of the
things I have hit on in the past week researching UTF-8 and CCSID's see the
following:
http://en.wikipedia.org/wiki/CCSID
http://en.wikipedia.org/wiki/UTF-8
http://www.tachyonsoft.com/iso88591.htm
http://www.tachyonsoft.com/uc0000.htm // This link does a good job of
addressing my original question
http://www-306.ibm.com/software/globalization/ccsid/ccsid819.jsp
http://www-306.ibm.com/software/globalization/ccsid/ccsid1208.jsp
http://publib.boulder.ibm.com/infocenter/iseries/v5r3/index.jsp?topic=/rzaha
/fileenc.htm
http://www.cl.cam.ac.uk/~mgk25/unicode.html
http://tools.ietf.org/html/rfc3629
http://czyborra.com/utf/

Thanks to all who responded.
Aaron Bartell
http://mowyourlawn.com

-----Original Message-----
From: rpg400-l-bounces@xxxxxxxxxxxx [mailto:rpg400-l-bounces@xxxxxxxxxxxx]
On Behalf Of Bruce Vining
Sent: Thursday, July 05, 2007 9:44 AM
To: RPG programming on the AS400 / iSeries
Subject: Re: CCSID conversion tables

Actually Ü is x'C39C' in UTF-8. Is it possible that you're converting
inplace and that insufficient space has been allocated for iconv to return
the two byte result in? This might cause you to think iconv is returning
x'DC' when in reality it's simply leaving your input data as is.

When working with UTF8 you need to remember that anything greater than 7-bit
ASCII will convert into a multi-byte encoding.

Bruce Vining




"albartell" <albartell@xxxxxxxxx>
Sent by: rpg400-l-bounces@xxxxxxxxxxxx
07/05/2007 08:53 AM
Please respond to
RPG programming on the AS400 / iSeries <rpg400-l@xxxxxxxxxxxx>


To
"'RPG programming on the AS400 / iSeries'" <rpg400-l@xxxxxxxxxxxx>
cc

Subject
CCSID conversion tables






Hi all,

I can't believe I haven't come across a need to see translation tables
from
one CCSID to the next, but today I need it :-) Traditionally, or at least
before I started using iconv and when I used QDCXLATE, I would simply do a
WRKTBL and select the appropriate table. I have a latin capital letter U
with diaeresis that is causing me problems when converting to UTF-8 from
ASCII ISO88591. Does anyone know if IBM has a translation site for
CCSID's
similar to this one I found?

<http://www.tachyonsoft.com/uc0000.htm>
http://www.tachyonsoft.com/uc0000.htm

I am using iconv to convert to CCSID 1208 (UTF-8) and the U with diaeresis
*should* be converting to x'C3' (as far as I can tell) but is remaining at
a
x'DC'.

TIA,
Aaron Bartell
<http://mowyourlawn.com> http://mowyourlawn.com

As an Amazon Associate we earn from qualifying purchases.

This thread ...

Replies:

Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.