Skip to Content
The Hadoop Performance Myth
book

The Hadoop Performance Myth

by Courtney Webster
April 2016
Intermediate to advanced content levelIntermediate to advanced
15 pages
27m
English
O'Reilly Media, Inc.

Overview

The wish lists of many data-driven organizations seem reasonable enough. They’d like to capitalize on real-time data analysis, move beyond batch processing for time-critical insights, allow multiple users to share cluster resources, and provide predictable service levels. However, fundamental performance limitations of complex distributed systems such as Hadoop prevent much of this from happening.

In this report, Courtney Webster examines the root cause of these performance problems and explains why best practices for mitigating them—cluster tuning, provisioning, and even cluster isolation for mission critical jobs—don’t provide viable, scalable, or long-term solutions.

Organizations have been pushing Hadoop and other distributed systems to their performance breaking points as they seek to use clusters as shared resources across multiple business units and individual users. Once they hit this performance wall, companies will find it difficult to deliver on the big data promise at scale.

Read this report to find out what the implications are for your organization.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Expert Hadoop® Administration

Expert Hadoop® Administration

Sam R. Alapati
Cloudera Impala

Cloudera Impala

John Russell

Publisher Resources

ISBN: 9781492042532Errata Page