7. The Unstructured Database

The foundation of textual analytics is the ability to access and analyze unstructured data in an unfettered manner. To achieve this state, an infrastructure suitable for analytical processing must be created. The heart of that environment is the unstructured database.

The term unstructured database/data warehouse is somewhat of an oxymoron in that it is a self-contradiction in terms. The term “unstructured” refers to the complete lack of discipline in the original creation of the unstructured text. The term “database” implies that the contents of the database are structured. Therefore, when one says unstructured database, it is like saying unstructured structured data. As contradictory as this seems, this is, in fact, ...

Get Tapping into Unstructured Data: Integrating Unstructured Data and Textual Analytics into Business Intelligence now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.