O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Search engines return results with page or relevancy rankings. They list the search result hits according to how closely the results match the search input. But the results leave users more perplexed and often frustrated.

Why does this happen? Most often search engines use the search terms frequency of occurrence within a document as a primary way of determining whether a document is relevant. If the search input is “colon cancer”, and the word "colon" appears multiple times in a document, the search engine determines the document to be relevant to your search and positions it to near the top of the result list. If your keyword is a commonly used phrase or has multiple other meanings, we could end up with a lot of irrelevant ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required