book

Disruptive Analytics: Charting Your Strategy for Next-Generation Business Analytics

by Thomas W. Dinsmore

August 2016

Beginner to intermediate

262 pages

8h 21m

English

Apress

Content preview from Disruptive Analytics: Charting Your Strategy for Next-Generation Business Analytics

Thomas W. Dinsmore, Disruptive Analytics, 10.1007/978-1-4842-1311-7_4

4. The Hadoop Ecosystem

Disrupting from Below

Thomas W. Dinsmore¹

(1)Newton, Massachusetts, USA

In 2003, Doug Cutting and Mike Cafarella struggled to build a web crawler to search and index the entire Internet. They needed a way to distribute the data over multiple machines, because there was too much data for a single machine.

To keep costs low, they wanted to use inexpensive commodity hardware. That meant they would need fault-tolerant software, so if any one machine failed, the system could continue to operate.

Early in their work, they ruled out using a relational database. Their data included diverse data structures and data types, without a predefined ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781484213117Purchase book

Disruptive Analytics: Charting Your Strategy for Next-Generation Business Analytics

by Thomas W. Dinsmore

4. The Hadoop Ecosystem

Disrupting from Below

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Trade-off Analytics

Analytics Stories

Analytics

Analytics

Publisher Resources

4. The Hadoop Ecosystem

Disrupting from Below

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Trade-off Analytics

Analytics Stories

Analytics

Analytics

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.