O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Combining both the SDLC and spiral approaches leads to these eleven steps that will be described in detail in this section:

1.      Understand the business problem and business context

2.      Survey the data sources to determine which data is useful

3.      Select and customize taxonomies

4.      Select the initial set of data

5.      Determine future iterations and source document requirements

6.      Choose the textual ETL tool

7.      Load parameters for transformations

8.      Execute ETL scripts with initial set of data

9.      Examine results and make adjustments, if needed

10.  Execute ETL scripts on remaining iterations

11.  Continuous business analysis and make adjustments, if needed

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required