July 2002
Intermediate to advanced
864 pages
22h 32m
English
The Internet is a collection of computer systems that communicate with each other in various protocols. Each of these computer systems contain documents that, in the form of HTML pages, can be easily distributed to other computers. These documents are also indexed by many modern search engines and by earlier systems such as the Wide Area Information Servers (WAIS). A number of initiatives exist to classify data to make the information more meaningful. The Internet can be thought of as the largest document repository in the world: a repository that users can leverage for themselves.
A number of problems exist with searching and indexing the content ...