.NET Framework Standard Library Annotated Reference, Volume 1: Base Class Library and Extended Numerics Library
by Brad Abrams
Description
System.Text.UnicodeEncoding encodes each Unicode character in UTF-16, i.e., as two consecutive bytes. Both little-endian and big-endian encodings are supported.
[Note: On little-endian platforms such as Intel machines, it is generally more efficient to store Unicode characters in little-endian. However, many other platforms can store Unicode characters in big-endian. Unicode files can be distinguished by the presence of the byte order mark (U+FEFF), which is written as either 0xfe 0xff or 0xff 0xfe. This System.Text.Encoding implementation can detect a byte order mark automatically and switch byte orders, based on a parameter specified in the constructor. ISO/IEC 10646 defines UCS-2 and UCS-4. UCS-4 is a four-byte (32-bit) encoding ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access