O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

A second approach to the handling of large volumes of data is to use the iterative, step at a time, approach in the editing, loading, and handling of data. The importance of iteration has been emphasized several times in this book. There are many reasons why doing processing in an iterative manner makes sense. The main reason why it makes sense is that rework is usually the norm for processing text. It is unusual to specify a process and have the process operate as expected the first time (or even the second time). When handling text, rework should be a common, normal expectation because much of the handling of text is an art, not a science. Text is imperfect and therefore handling text is also imperfect.

Knowing ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required