O'Reilly logo

HBase in Action: Foreword by Michael Stack by Nick Dimiduk and Amandeep Khurana

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 3. Distributed HBase, HDFS, and MapReduce

 

This chapter covers
  • HBase as a distributed storage system
  • When to use MapReduce instead of the key-value API
  • MapReduce concepts and workflow
  • How to write MapReduce applications with HBase
  • How to use HBase for map-side joins in MapReduce
  • Examples of using HBase with MapReduce

 

As you’ve realized, HBase is built on Apache Hadoop. What may not yet be clear to you is why. Most important, what benefits do we, as application developers, enjoy from this relationship? HBase depends on Hadoop for two separate concerns. Hadoop MapReduce provides a distributed computation framework for high-throughput data access. The Hadoop Distributed File System (HDFS) gives HBase a storage layer providing availability ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required