O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Step 4 in the building of an unstructured data warehouse is to start by choosing a very small subset of data to begin the development process. If the data resides on paper, then an OCR process needs to be initiated.

The purpose of selecting a small subset of data to start with is to make sure that if there are insurmountable or unexpected problems with the data, they are recognized as soon as possible. The subset that is selected should be small but representative of the range of data that will be brought into the unstructured data warehouse.

Certainly all potential problems will not be discovered in this step, but if there are large, unforeseen problems waiting, they should be identified as soon as possible. ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required