O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

In reality, there are many ways to build the unstructured data warehouse. By understanding the business objectives, the analyst/designer can make intelligent choices as to how to build the unstructured data warehouse. For example, if the source data is email, the designer will probably want to filter blather, do stop word processing, and create a fractured document. If the documents are contracts, the designer is likely to use named value processing and document fracturing. If the source is doctors’ notes, the designer will want to use homographic resolution and external categorization, as well as alternate spelling. The nature of the source and the ultimate usage of the documents dictate the kind of textual ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required