O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Sub doc processing occurs when a document contains logical subdivisions that need to be recognized in the index. The first step in sub doc processing is that of defining the delimiter or delimiters that will specify a logical break in the document. The delimiter may be a standard character or a special character. In many cases, the delimiter is a string of characters.

The first pass through the data identifies where the delimiters are. With sub doc processing, a second pass through the data is required, during which the words that were indexed are divided into subdivisions based on the appearance of the delimiters.

Some documents have no logical sub structuring, while others do. When a document does have logical ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required