Book description
Master HBase configuration and administration for optimum database performance with this book and ebook.
- Move large amounts of data into HBase and learn how to manage it efficiently
- Set up HBase on the cloud, get it ready for production, and run it smoothly with high performance
- Maximize the ability of HBase with the Hadoop eco-system including HDFS, MapReduce, Zookeeper, and Hive
In Detail
As an Open Source distributed big data store, HBase scales to billions of rows, with millions of columns and sits on top of the clusters of commodity machines. If you are looking for a way to store and access a huge amount of data in real-time, then look no further than HBase.
HBase Administration Cookbook provides practical examples and simple step-by-step instructions for you to administrate HBase with ease. The recipes cover a wide range of processes for managing a fully distributed, highly available HBase cluster on the cloud. Working with such a huge amount of data means that an organized and manageable process is key and this book will help you to achieve that.
The recipes in this practical cookbook start from setting up a fully distributed HBase cluster and moving data into it. You will learn how to use all of the tools for day-to-day administration tasks as well as for efficiently managing and monitoring the cluster to achieve the best performance possible. Understanding the relationship between Hadoop and HBase will allow you to get the best out of HBase so the book will show you how to set up Hadoop clusters, configure Hadoop to cooperate with HBase, and tune its performance.
Table of contents
-
HBase Administration Cookbook
- Table of Contents
- HBase Administration Cookbook
- Credits
- About the Author
- Acknowledgement
- About the Reviewers
- www.PacktPub.com
- Preface
- 1. Setting Up HBase Cluster
- 2. Data Migration
-
3. Using Administration Tools
- Introduction
- HBase Master web UI
- Using HBase Shell to manage tables
- Using HBase Shell to access data in HBase
- Using HBase Shell to manage the cluster
- Executing Java methods from HBase Shell
- Row counter
- WAL tool—manually splitting and dumping WALs
- HFile tool—viewing textualized HFile content
- HBase hbck—checking the consistency of an HBase cluster
- Hive on HBase—querying HBase using a SQL-like language
- 4. Backing Up and Restoring HBase Data
-
5. Monitoring and Diagnosis
- Introduction
- Showing the disk utilization of HBase tables
- Setting up Ganglia to monitor an HBase cluster
- OpenTSDB—using HBase to monitor an HBase cluster
- Setting up Nagios to monitor HBase processes
- Using Nagios to check Hadoop/HBase logs
- Simple scripts to report the status of the cluster
- Hot region—write diagnosis
-
6. Maintenance and Security
- Introduction
- Enabling HBase RPC DEBUG-level logging
- Graceful node decommissioning
- Adding nodes to the cluster
- Rolling restart
- Simple script for managing HBase processes
- Simple script for making deployment easier
- Kerberos authentication for Hadoop and HBase
- Configuring HDFS security with Kerberos
- HBase security configuration
-
7. Troubleshooting
- Introduction
- Troubleshooting tools
- Handling the XceiverCount error
- Handling the "too many open files" error
- Handling the "unable to create new native thread" error
- Handling the "HBase ignores HDFS client configuration" issue
- Handling the ZooKeeper client connection error
- Handling the ZooKeeper session expired error
- Handling the HBase startup error on EC2
- 8. Basic Performance Tuning
-
9. Advanced Configurations and Tuning
- Introduction
- Benchmarking HBase cluster with YCSB
- Increasing region server handler count
- Precreating regions using your own algorithm
- Avoiding update blocking on write-heavy clusters
- Tuning memory size for MemStores
- Client-side tuning for low latency systems
- Configuring block cache for column families
- Client side scanner setting
- Tuning block size to improve seek performance
- Enabling Bloom Filter to improve the overall throughput
- Index
Product information
- Title: HBase Administration Cookbook
- Author(s):
- Release date: August 2012
- Publisher(s): Packt Publishing
- ISBN: 9781849517140
You might also like
book
HBase: The Definitive Guide
If you're looking for a scalable storage solution to accommodate a virtually endless amount of data, …
video
Introduction to Apache HBase Operations
HBase master Jonathan Hsieh provides a complete overview of Apache HBase operations in this course designed …
book
HBase High Performance Cookbook
Exciting projects that will teach you how complex data can be exploited to gain maximum insights …
book
HBase in Action
HBase in Action has all the knowledge you need to design, build, and run applications using …