O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Search engines use automated software programs to survey the Web and build their databases. Web documents are retrieved by these programs and analyzed. Data collected from each web page is added to the search engine index. When you enter a query at a search engine site, the input data is matched against the search engine's index of all the web pages it has analyzed. The best results are then returned to the user as hits, ranked in order with the best results at the top.

Legacy search depends on keyword searching. The most common form of text search on the Web, search engines do their text query and retrieval using keywords.

What is a keyword? It can simply be any word on a webpage. For example, if we use the word “complex" making ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required