Chapter 2 Information Extraction Using SAS Crawler

Introduction to Information Extraction and Organization

SAS Crawler

SAS Search and Indexing

SAS Information Retrieval Studio Interface

Web Crawler

Breadth First

Depth First

Web Crawling: Real-World Applications and Examples

Understanding Core Component Servers

Proxy Server

Pipeline Server

Component Servers of SAS Search and Indexing

Indexing Server

Query Server

Query Web Server

Query Statistics Server

SAS Markup Matcher Server

Summary

References

Introduction to Information Extraction and Organization

In a little less than two decades, we have witnessed how the Internet has evolved and become an integral part of a common man’s life. The emergence of many e-commerce websites and social media channels ...

Get Text Mining and Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.