O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Leveraging Taxonomies

In preparation for executing textual ETL, you will need to select and prepare the taxonomies that will be used. A taxonomy, in its simplest form, is a list of categories or synonyms. Taxonomies are necessary for the creation of the “meta” layer of text used to address terminology.

If you are going to be using taxonomies, you need to select the ones that are the most germane to the unstructured data warehouse that you are building. Suppose you are building a manufacturing control unstructured data warehouse. You will probably need taxonomies on the product that is being manufactured and the manufacturing process. You will almost certainly not need taxonomies on code of ethics, Sarbanes-Oxley, recreational activities, and ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required