|
I don't know when "MBCS" was introduced, so maybe you are speaking from a context where SBCS and DBCS are the only possible choices. But UTF-8 is definitely a MBCS encoding. The part of UTF-8 that overlaps with ASCII are one-byte characters, but any given character[1] can use up to 4 bytes to encode in UTF-8.
CCSID is a special IBM construct that is a little bit more complicated than the notion of "encoding" that the rest of the world uses.
But I would say a good first step (and for many, the only step that's really needed) is to pretend that CCSIDs really are "just" encodings in the same sense that the rest of the world uses, and then learn about those encodings. And yet again, I will point to my favorite article on the topic:
https://www.joelonsoftware.com/2003/10/08/the-absolute-minimum-every-software-developer-absolutely-positively-must-know-about-unicode-and-character-sets-no-excuses/
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.