O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Perhaps the most effective way of managing large amounts of data in the unstructured data warehouse environment is that of preventing unnecessary and unneeded data from ever getting into the unstructured data warehouse. On the one hand, it is true that the unstructured environment contains a lot more data than the structured environment. On the other hand, not all of that unstructured data belongs in the unstructured data warehouse. For example, suppose there are a lot of emails. The relevant sections of some emails undoubtedly belong in the unstructured data warehouse, but other emails do not belong there. Therefore, one of the processes that emails pass through before being sent to Textual ETL is a relevancy filter. Only those ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required