|
Anyone confirm that some machine code stuff is done
better with little-endian?
Now it is not required to use a BOM. I suppose one can identify which
flavor of UTF-16 you have when you determine if a null is in an odd
position or an even position - even is LE, odd is BE - not sure how
anyone else, such as NotePad++, does that.
But UTF-8 is another creature - it doesn't have endian flavors but does
have a BOM, EF BB BF - it also is not required, and if it's absent, you
have a real guessing game on your hands. On the i, what I did was try to
copy the text file to one with 1208 CCSID - if successful, I considered
the contents to be UTF-8. Not great but mostly useful. There ARE certain
byte sequences that, I will say, probably can be sure to mean the
contents is UTF-8
Patrik, by the flag in metadata, do you mean the CCSID or the code page?
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2025 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.