O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

After the iterative development plan has been developed, it is time to select a tool for Textual ETL. Each organization needs to determine its own criteria and priorities for a tool. Some of the criteria for the selection of the tool may include the ability to:

·         Handle large volumes of data

·         Create the data warehouse in a multiplicity of technologies

·         Read in and manage many different input sources

·         Do external categorization

·         Do sub document processing

·         Do basic editing, such as alternate spelling and data standardization

·         Do named value processing

·         Do proximity analysis and clustering

·         Operate in multiple languages

·         Handle ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required