O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

There are many different unstructured components that are specified by DW 2.0. In one form or another, these components from DW 2.0 can be placed or built in the unstructured data warehouse. Some of the important components include:

·         Simple pointer data. A simple pointer is where an unstructured word or phrase has its reference to the source text disclosed. As an example, the word “hathaway” is found in doc “rty”, word 12998.

·         Textual subjects. This includes both external and internal subjects. As an example, the words in the doc “yut” all center around liver cancer.

·         Captured text. This includes where the actual text is brought over from the unstructured environment. For example, email “bnn1226” ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required