January 2012
Intermediate to advanced
1000 pages
28h 4m
English
Contents
The Opportunities and Challenges of Mining the Web
Topic Hierarchies for Focused Crawling
Focused web crawling stands one step above the other techniques discussed in this book. It is an integrated technology that combines two base technologies: classification and web analytics. This combination of basic capabilities enables more complex decision making and provides information that is more specific to its intended use. Figure 15.1 shows a complete analysis system that “feeds” automatically on Internet data and outputs information that can be used to make ...
Read now
Unlock full access