© Deepak Vohra 2016

Deepak Vohra, Apache HBase Primer, 10.1007/978-1-4842-2424-3_15

15. Defining the Row Keys

Deepak Vohra

(1)Apt 105, White Rock, British Columbia, Canada

The primary data access pattern is by row key. No design-time way to specify row keys exists because to HBase they are simply byte arrays. When designing for optimal read performance , it is important to first understand the read path.

  1. The read request is made by a client.

  2. HBase identifies the files that store the rows.

  3. The block index in each file identifies the block in which the row is found.

  4. HBase performs a scan to fetch all the key/value pairs for the request.

  5. A copy is stored in the block cache in memory before a row is returned to the client. Block cache stores the data in ...

Get Apache HBase Primer now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.