O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Now the processing parameters for Textual ETL are created. These parameters may include such things as:

·         External categorizations

·         Stop word processing

·         Alternate spelling

·         Homograph resolution

·         Clustering

·         Proximity analysis

·         Pattern searching, and so forth.

Once the parameters are specified, the execution panel is prepared, the queue for processing is established, and processing commences. It is absolutely abnormal for the first setting of parameters to be correct. There are so many nuances to text and so many possibilities, that an accurate initial parametric setting is almost unheard of. Instead parameters are set, a few documents are run, the results ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required