Chapter 9

Text Mining and Natural Language Processing

OUTLINE

Preamble

Pattern recognition is the most basic description of what is done in the process of data mining. Usually, these patterns are stored in structured databases and organized into records, which are composed of rows and columns of data. The columns are attributes (numbers or text strings) associated with a table (entity), accessed by links between attributes among the tables (relations). This entity-relational structure of data is called a relational database. Large relational databases store huge quantities of data in data warehouses in ...

Get Handbook of Statistical Analysis and Data Mining Applications now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.