|
Hi Eiichi, I wanted to thank you for your messages. It is always very helpful for a native speaker to contibute with regard to other languages. Although I have a master's degree in international management and have written i18n tutorials and moderated i18n forums, my direct experience with Asian languages is limited to a few Japanese and Korean phrases ( including swearing in Korean ;-) ). As you might expect, I can also recognize the ideograph and word for '\u4E94'. At the same time, your response is somewhat bothersome. My references for the file encoding are directly from the v5r2 version of the IBM Developer Kit for Java manual. While it also shows that Cp930 is used for "IBM EBCDIC Japanese Extended Katakana", it lists it as used with CCSID 930. I think most programmers are going to look at the CCSID, which Joe noted as 943 and 5026, and check the corresponding encodings in the table. Please note that this is not to argue with you and is intended generally, for any language. My point is that I hope the responsible parties at IBM have or will take great pains to verify file encoding and CCSID matching. We developers really have few alternatives when it comes to multilingual apps. Again, thanks and I hope you and other native non-English speakers will jump in when someone like me is in need of assistance. BTW, my friends call me Joe Sam Joe Sam Shirah - http://www.conceptgo.com conceptGO - Consulting/Development/Outsourcing Java Filter Forum: http://www.ibm.com/developerworks/java/ Just the JDBC FAQs: http://www.jguru.com/faq/JDBC Going International? http://www.jguru.com/faq/I18N Que Java400? http://www.jguru.com/faq/Java400 ----- Original Message ----- From: <eiichi.yoshihira@xxxxxxxxxxxxx> To: <java400-l@xxxxxxxxxxxx> Sent: Wednesday, January 21, 2004 5:52 AM Subject: RE: DBCS to Shift-JIS > Joe and Joe,, > > If you're using CCSID 5026, the Java equivalent encoding value should be > Cp930. > CCSID hex ESID Base CP1 CP2 Java K > Description > 5026 13a2 mbcs-ebcdic 930 290 + 300 Cp930 I Japan > EBCDIC > 5035 13ab mbcs-ebcdic 939 1027 + 300 Cp939 I Japan > EBCDIC > > a side note on this, Cp943c, Cp943, MS932 and SJIS may have different > mappings to Unicode characters, so if you have any problems converting > certain Japanese characters in Java, I suggest refering to this url for > selecting the correct mapping. > http://www.ingrid.org/java/i18n/encoding/ja-conv.html (yes, its in > Japanese...) > > Regards, > Eiichi >
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.