Skip to Content
Hadoop in Practice, Second Edition
book

Hadoop in Practice, Second Edition

by Alex Holmes
September 2014
Intermediate to advanced content levelIntermediate to advanced
512 pages
13h 54m
English
Manning Publications
Content preview from Hadoop in Practice, Second Edition

Part 3. Big data patterns

Now that you’ve gotten to know Hadoop and know how to best organize, move, and store your data in Hadoop, you’re ready to explore part 3 of this book, which examines the techniques you need to know to streamline your big data computations.

In chapter 6 we’ll examine techniques for optimizing MapReduce operations, such as joining and sorting on large datasets. These techniques make jobs run faster and allow for more efficient use of computational resources.

Chapter 7 examines how graphs can be represented and utilized in Map-Reduce to solve algorithms such as friends-of-friends and PageRank. It also covers how data structures such as Bloom filters and HyperLogLog can be used when regular data structures can’t scale to ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Hadoop in Action

Hadoop in Action

Chuck Lam
Mastering Hadoop 3

Mastering Hadoop 3

Timothy Wong, Chanchal Singh, Manish Kumar
Hadoop Application Architectures

Hadoop Application Architectures

Mark Grover, Ted Malaska, Jonathan Seidman, Gwen Shapira

Publisher Resources

ISBN: 9781617292224Supplemental ContentPublisher SupportOtherPublisher WebsiteErrata PageSupplemental ContentPurchase Link