O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

If punctuation can become an impediment to successful analysis of unstructured data, so may its font. If a query looks at a specific font literally, then the query may miss many legitimate instances where a hit should have been made but wasn’t because of a mismatch of fonts.

For example, a query is done for all instances of “Wimbledon”, where Wimbledon is in Arial font. The analysis would miss all instances of “Wimbledon” where the text is not in Arial font – even an instance Wimbledon” in Arial Black font would not be retrieved. Beware of font-sensitive search engines.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required