O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

The primary architectural consideration of managing large volumes of text in an unstructured data warehouse is that of not placing actual text in the unstructured data warehouse. Instead, the text that is placed in the unstructured data warehouse is that text that is most useful in decision making. Stated differently, the unstructured data warehouse should contain only the distilled data that is useful for decision making, while the actual text remains at its source.

As a simple example, suppose there was this email:

I received your invoice for $238.18 yesterday. I will see to it that AM Rogers is paid posthaste. Thank you for the opportunity to do business with you.

  Todd Beltham

The email would remain in ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required