book

Big Data

April 2016

Beginner to intermediate

463 pages

18h 53m

English

Read now

Unlock full access

Content preview from Big Data

112 

Big Data: Storage, Sharing, and Security

The purpose of [9] is to crawl over all entity documents in deep web data sources, such

as the documents containing product names and other attributes. Since the authors have all

query logs of Google, the query logs (its format is < query,url

clicked

,times

clicked

>)toward

a target data source are collected and only queries that are clicked for at least two times are

considered. Then the relevant entity names are extracted from the satisﬁed log queries. The

extraction is based on the Freebase data [37] that provides 22 million entity names. Finally, all

extracted entity names will be sent to the target data ...

Bernard Marr

Kuan-Ching Li, Hai Jiang, Laurence T. Yang, Alfredo Cuzzocrea

Eglantine Schmitt

James Warren, Nathan Marz

ISBN: 9781498734875