U
UCS (Universal Multiple-Octet Coded Character Set)

The core ISO10646 multi-byte data format. Transformation formats reduce the size by using single bytes for common characters. These formats are called UTF, with 'U' standing for UCS.

UCS Transformation Format
See [UTF (UCS Transformation Format)A mechanism for compressing UCS-2 and UCS-4 encoded data for transfer between systems. The two-byte or four-byte UCS character representation schemes are often wasteful when relatively common characters are in use. A text file can easily be compressed by as much as 75 per cent. Two variants, UTF-8 (8-bit) and UTF-16 (16-bit), offer different levels of character range support.]
UCS Transformation Format 16 Bit Form
See [UTF-16 ]
UCS-2

An ISO/IEC 10646 ...

Get XML Companion, The, Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.