O'Reilly logo

Hadoop 2.x Administration Cookbook by Gurmukh Singh

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Hive performance tuning

In this recipe, we will cover Hive tuning by touching upon some important parameters. Hive is a data warehousing solution which runs on top of Hadoop, as discussed in Chapter 7, Data Ingestion and Workflow. Please refer to it for installation and configuration of Hive.

Getting ready

Make sure that the user has a running cluster with Hive installed and configured to run with the ZooKeeper ensemble. Users can refer to Chapter 7, Data Ingestion and Workflow on Hive, for configuring that.

How to do it...

  1. Connect to the Edge node client1.cyrus.com and switch to the hadoop user.
  2. If you have followed the previous recipes, Hive is installed at /opt/cluster/hive on the Edge node.
  3. The first thing is to tune the JVM heap used, when Hive ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required