Block Escapes
A block escape is a simple way to refer to a range of
characters that have some property in common. Each block escape has a
name; to use a block name, prepend Is
to it. Block escapes are used with the
\p
and \P
operators. For example, the expression
\p{IsThai}
refers to the Thai characters (฀
– ๿
). The expression \P{IsThai}
refers to everything except Thai
characters. The block names are listed here in the format defined in
the XML Schema spec.
Table E-1 shows the complete list of block escapes. This table was generated from version 5.0.0 of the file blocks.txt. The list of block escape names is part of the Unicode Character Database; see http://www.unicode.org/ for the latest version of the Unicode standard.
Block name | Starting character | Ending character |
BasicLatin | � |  |
Latin-1Supplement | € | ÿ |
LatinExtended-A | Ā | ſ |
LatinExtended-B | ƀ | ɏ |
IPAExtensions | ɐ | ʯ |
SpacingModifierLetters | ʰ | ˿ |
CombiningDiacriticalMarks | ̀ | ͯ |
GreekandCoptic | Ͱ | Ͽ |
Cyrillic | Ѐ | ӿ |
CyrillicSupplement | Ԁ | ԯ |
Armenian | ԰ | ֏ |
Hebrew | ֐ | ׿ |
Arabic | ؀ | ۿ |
Syriac | ܀ | ݏ |
ArabicSupplement | ݐ | ݿ |
Thaana | ހ | ޿ |
NKo | ߀ | ߿ |
Devanagari | ऀ | ॿ |
Bengali | ঀ | ৿ |
Gurmukhi | ਀ | ੿ |
Gujarati | ઀ | ૿ |
Oriya | ଀ | ୿ |
Tamil | ஀ | ௿ |
Telugu | ఀ | ౿ |
Kannada | ಀ | ೿ |
Malayalam | ഀ | ൿ |
Sinhala | ඀ | ෿ |
Thai | ฀ | ๿ |
Lao | ຀ ... |
Get XSLT, 2nd Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.