Character Entities
Characters not found in the normal alphanumeric character set,
such as < and &, must be specified in HTML and XHTML documents
using character references. This process is known as
escaping the character. In (X)HTML documents,
escaped characters are indicated by character references that begin with
& and end with ; . The character may be referred to by its
Numeric Character Reference (NCR) or a predefined character entity
name.
A Numeric Character Reference refers to a character by its Unicode
code point in either decimal or hexadecimal form. Decimal character
references use the syntax &#nnnn;
. Hexadecimal values are indicated by an “x”: &#xhhhh; . For example, the less-than
(<) character could be identified as < (decimal) or < (hexadecimal).
Character entities are abbreviated names for characters, such as
< for the less-than symbol.
Character entities are predefined in the DTDs of markup languages such
as HTML and XHMTL as a convenience to authors because they may be easier
to remember than Numeric Character References.
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access