O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

There have been attempts in the past to understand unstructured text linguistically. This approach is called the “natural language approach” (or the “NLP” approach).[5] While the linguistic approach has its proponents, the approach that will be described in this book is decidedly non linguistic. The approach followed in this book can best be described as the “thematic” approach to reducing text into an analyzable form and format. The thematic approach is one in which each word is considered to merely be a unit of data in a database. The words can then be organized (that is, clustered) according to themes.

The NLP approach mandates that words and language be understood through the understanding of the context of the words. ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required