Book description
Learn the fundamentals of HBase administration and development with the help of real-time scenarios
In Detail
Apache HBase is a nonrelational NoSQL database management system that runs on top of HDFS. It is an open source, distributed, versioned, column-oriented store. It facilitates the tech industry with random, real-time read/write access to your Big Data with the benefit of linear scalability on the fly.
This book will take you through a series of core tasks in HBase. The introductory chapter will give you all the information you need about the HBase ecosystem. Furthermore, you'll learn how to configure, create, verify, and test clusters. The book also explores different parameters of Hadoop and HBase that need to be considered for optimization and a trouble-free operation of the cluster. It will focus more on HBase's data model, storage, and structure layout. You will also get to know the different options that can be used to speed up the operation and functioning of HBase. The book will also teach the users basic- and advance-level coding in Java for HBase. By the end of the book, you will have learned how to use HBase with large data sets and integrate them with Hadoop.
What You Will Learn
- Understand the fundamentals of HBase
- Understand the prerequisites necessary to get started with HBase
- Install and configure a new HBase cluster
- Optimize an HBase cluster using different Hadoop and HBase parameters
- Make clusters more reliable using different troubleshooting and maintenance techniques
- Get to grips with the HBase data model and its operations
- Get to know the benefits of using Hadoop tools/JARs for HBase
Table of contents
-
Learning HBase
- Table of Contents
- Learning HBase
- Credits
- About the Author
- Acknowledgments
- About the Reviewers
- www.PacktPub.com
- Preface
-
1. Understanding the HBase Ecosystem
- HBase layout on top of Hadoop
- Comparing architectural differences between RDBMs and HBase
- HBase features
- HBase in the Hadoop ecosystem
- Comparing functional differences between RDBMs and HBase
- About the internal storage architecture of HBase
- Getting started with HBase
- Applications of HBase
- HBase pros and cons
- Summary
- 2. Let's Begin with HBase
-
3. Let's Start Building It
- Downloading Java on Ubuntu
- Considering host configurations
- Installing and configuring SSH
- Installing and configuring NTP
- Performing capacity planning
- Installing and configuring Hadoop
- Hadoop start up steps
- Configuring Apache HBase
- Installing and configuring ZooKeeper
- Installing Cloudera Hadoop and HBase
- Installing the Hadoop and MapReduce packages
- Installing Hadoop on Windows
- Summary
- 4. Optimizing the HBase/Hadoop Cluster
- 5. The Storage, Structure Layout, and Data Model of HBase
-
6. HBase Cluster Maintenance and Troubleshooting
- Hadoop shell commands
- HBase shell commands
- HBase administration tools
- Writing HBase shell scripts
- Using the Hadoop tool or JARs for HBase
- Connecting HBase with Hive
- HBase region management
- HBase node management
- Implementing security
- Troubleshooting the most frequent HBase errors and their explanations
- Summary
- 7. Scripting in HBase
- 8. Coding HBase in Java
- 9. Advance Coding in Java for HBase
- 10. HBase Use Cases
- Index
Product information
- Title: Learning HBase
- Author(s):
- Release date: November 2014
- Publisher(s): Packt Publishing
- ISBN: 9781783985944
You might also like
book
Apache HBase Primer
Learn the fundamental foundations and concepts of the Apache HBase (NoSQL) open source database. It covers …
book
HBase Essentials
A practical guide to realizing the seamless potential of storing and managing high-volume, high-velocity data quickly …
book
Apache Hadoop 3 Quick Start Guide
A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem …
book
HBase in Action
HBase in Action has all the knowledge you need to design, build, and run applications using …