Skip to Content
Disruptive Analytics: Charting Your Strategy for Next-Generation Business Analytics
book

Disruptive Analytics: Charting Your Strategy for Next-Generation Business Analytics

by Thomas W. Dinsmore
August 2016
Beginner to intermediate
262 pages
8h 21m
English
Apress
Content preview from Disruptive Analytics: Charting Your Strategy for Next-Generation Business Analytics

© Thomas W. Dinsmore 2016

Thomas W. Dinsmore, Disruptive Analytics, 10.1007/978-1-4842-1311-7_4

4. The Hadoop Ecosystem

Disrupting from Below

Thomas W. Dinsmore

(1)Newton, Massachusetts, USA

In 2003, Doug Cutting and Mike Cafarella struggled to build a web crawler to search and index the entire Internet. They needed a way to distribute the data over multiple machines, because there was too much data for a single machine.

To keep costs low, they wanted to use inexpensive commodity hardware. That meant they would need fault-tolerant software, so if any one machine failed, the system could continue to operate.

Early in their work, they ruled out using a relational database. Their data included diverse data structures and data types, without a predefined ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Trade-off Analytics

Trade-off Analytics

Gregory S. Parnell PhD
Analytics Stories

Analytics Stories

Wayne L. Winston
Analytics

Analytics

Phil Simon
Analytics

Analytics

Phil Simon

Publisher Resources

ISBN: 9781484213117Purchase book