Appendix 1What Is Unicode?

Throughout this book, you’ll encounter the word “Unicode” in several recipes. So, what exactly is Unicode?

Unicode is a character encoding system that provides a consistent way of encoding, processing, and displaying written texts. Put simply, an encoding system assigns numbers to characters, which can then be translated into binary language used by computers.

Unicode is implemented in all modern operation systems and programming languages. And it plays an increasingly important role in the JavaScript language. Prior to the invention of Unicode, there were hundreds of different character encoding systems, most of which were severely limited in size and scope and incompatible with one another.

The most widely used ...

Get Text Processing with JavaScript now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.