Skip to Main Content
Building an Intelligent Web: Theory and Practice
book

Building an Intelligent Web: Theory and Practice

by Pawan Lingras, Rajendra Akerkar
March 2010
Intermediate to advanced content levelIntermediate to advanced
326 pages
12h 25m
English
Jones & Bartlett Learning
Content preview from Building an Intelligent Web: Theory and Practice
“4137X˙CH08˙Akerkar” 2007/9/14 15:28 page 318 #30
318 CHAPTER 8 Web Structure Mining
8.2.4.2 Uniform URL Sampling
In the paper “On Near-Uniform URL Sampling” (2000), Henzinger and her colleagues intro-
duced another sampling approach. It provides a nearly uniform sample of the Web. The ability
to choose a URL uniformly at random allows us to estimate some web properties. These prop-
erties include the percentage of pages in a domain, the percentage of pages on a topic, and the
comparison of the index size of various search engines.
Research in uniform URL sampling involves random walks over a sample of well-connected
pages. The metho ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Reinventing the Organization for GenAI and LLMs

Reinventing the Organization for GenAI and LLMs

Ethan Mollick

Publisher Resources

ISBN: 9780763741372