Integrating Storm with HBase

As explained in earlier chapters, Storm is meant for real-time data processing. However, in most cases, you will need to store the processed data in a data store so that you can use the stored data for further analysis and can execute the analysis query on the data stored. This section explains how you can store the data processed by Storm in HBase.

HBase is a NoSQL, multidimensional, sparse, horizontal scalable database modeled after Google BigTable. HBase is built on top Hadoop, which means it relies on Hadoop and integrates with the MapReduce framework very well. Hadoop provides the following benefits to HBase.

  • A distributed data store that runs on top of commodity hardware
  • Fault tolerance

We will assume that you have ...

Get Learning Storm now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.