Image    23 Web Retrieval and Mining

Carlos Castillo and Ricardo Baeza-Yates

CONTENTS

Introduction

Web Search

Web Crawling

Indexing

Querying and Ranking

Relevance

Quality

Ranking Manipulation

Web Mining

Content Mining

Link Mining

Usage Mining

Conclusions and Current Trends

References

Bibliography

INTRODUCTION

Information retrieval is the area of computer science concerned with the representation, storage, organization, and access to documents.

Documents, in this definition, are understood in a broad sense, and include Web pages and other contents available on the Web. The Web is an unique medium for information dissemination, characterized by low entry ...

Get Understanding Information Retrieval Systems now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.