O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

The textual integration process for unstructured data is applied in an iterative manner, just as is the structured ETL process. Figure 3.8 shows that textual integration is processed iteratively. First, one pass of integration is made at the text. The results are analyzed, the processing parameters are refined, and the integration processing is repeated on the same text. The refinements that are made are made entirely on the basis of the analysis that is done (or cannot be done) on the data that has been created. If an analysis cannot be done or is done incorrectly, then the parameters that shape the textual data are adjusted so that analysis can be done and can be done correctly. This iterative process continues until the analytical ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required