CHAPTER 9 THE FUTURE OF WEB MINING

The Web continues to grow, but the pace has slackened since the early years (1994–1999). There is a relatively steady flux and turnover. Search engines started from their IR ancestors but made a substantial technological leap, as we have seen in this book. Other operations on hypertextual documents, such as crawling, clustering, and classification, have also been enhanced by the research described here.

Information foraging on the Web is now vastly easier than in the initial years of crawlers and search engines, but it is running up against the “syntactic search” barrier. Large search engines rarely get into in-depth linguistic analysis of document collections because many processes in automatic language processing ...

Get Mining the Web now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.