Hive on HBase—querying HBase using a SQL-like language

HBase supports several interfaces to access data in its tables, such as the following:

  • HBase Shell
  • Java Client API
  • REST, Thrift, and Avro

HBase Shell is straightforward, but a little too simple to perform complex queries on. Other interfaces need programming, which is not suitable for ad hoc queries.

As data keeps growing, people might want an easy way to analyze the large amount of data stored in HBase. The analysis should be efficient, ad hoc, and it should not require programming. Hive is currently the best approach for this purpose.

Hive is a data warehouse infrastructure built for Hadoop. Hive is used for ad hoc querying, and analyzing a large data set without having to write a MapReduce program. ...

Get HBase Administration Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.