O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

One of the guiding principles when reading unstructured data is that, once read, the unstructured data should not be read again; unless of course modifications have been made to the unstructured data. Rereading unstructured data when not necessary can be a colossal waste of resources.

There are many good ways to accomplish the objective of reading an unstructured document only once. Perhaps the most common approach is to use the metadata attached to the document to determine when the document was last updated. If there has been an update since the last time the document was considered or processed, then of course the document should be reprocessed. But many documents – once created – are never modified. In fact, with ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required