O'Reilly logo

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications by Dursun Delen, Robert Nisbet, Thomas Hill, Andrew Fast, John Elder, Gary Miner

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 15

Focused Web Crawling

Contents

Preamble

Focused web crawling stands one step above the other techniques discussed in this book. It is an integrated technology that combines two base technologies: classification and web analytics. This combination of basic capabilities enables more complex decision making and provides information that is more specific to its intended use. Figure 15.1 shows a complete analysis system that “feeds” automatically on Internet data and outputs information that can be used to make ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required