On Mon, Mar 4, 2013 at 2:52 PM, Vernon Hamberg <vhamberg@xxxxxxxxxxxxxxx> wrote:
If you have a file saved on a Unix box or maybe a Macintosh, it'll have
only one of these, either x0D or x0A.
Macs haven't used CR in quite some time. For all practical purposes,
there are only two EOL conventions remaining: CRLF for Windows and LF
for everything else.
So the typical UTF-16
representation for our situation is ANSI characters alternating with
x00's. We also have tab characters, which end up as x05 in the PF.
That is probably typical, but watch out for characters specific to
Windows-1252 which don't "fit" in one byte. These are the notorious
not-the-same-as-ISO-8859-1 characters in the (hex) range 80-9F. Most
notably, these include the Euro symbol and the angled ("smart")
quotes.
John