O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

An important aspect of processing text is managing external categorization. External categorization is handled by managing text through classifications based on taxonomies. If you are going to be doing external category processing, then you will need taxonomies. But if you are not going to be doing external category processing, you will not need taxonomies.

The term “external category” refers to the fact that the contents of a body of text are analyzed in accordance with externally created criteria. Figure 8.7 shows that several external criteria, that is, taxonomies, are selected and that those criteria are applied against a body of text.

Figure 8.7 Applying taxonomies to a body of text

For example, the body of text ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required