Skip to Content
Big Data
book

Big Data

by Fei Hu
April 2016
Beginner to intermediate
463 pages
18h 53m
English
Auerbach Publications
Content preview from Big Data
Challenges in Crawling the Deep Web
117
queries. If all orders exhibit a strong correlation, RankingReward(q
j
) is closer to 0. Note that
ˆ
δ
j
is calculated by the estimator in [25] and c
j
consists of network communication and bandwidth
consume that can measured by f
j
.
4.5 Discussions and Conclusions
In deep web crawling a query returns multiple documents that result in duplicates. Reducing
this redundancy is a unique problem in deep web crawling, and the source of the challenges
in deep web crawling. The major cost is the network traffic which could be measured by the
number of queries for small data sources. For large data sources such as online social ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Big Data

Big Data

Bernard Marr
Big Data

Big Data

Kuan-Ching Li, Hai Jiang, Laurence T. Yang, Alfredo Cuzzocrea
Big Data

Big Data

Eglantine Schmitt
Big Data

Big Data

James Warren, Nathan Marz

Publisher Resources

ISBN: 9781498734875