O'Reilly logo

Hadoop Operations and Cluster Management Cookbook by Shumin Guo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Installing Pig

Similar to Hive, Pig provides a handy tool for manipulating Hadoop data. In this recipe, we are going to discuss the installation of Apache Pig.

Getting ready

Before we install Pig, we need to make sure Hadoop has been properly installed. Please refer to the previous sections about the configuration of a Hadoop cluster.

Download the Pig archive file from a mirror site with the following command on the administrator machine:

wget http://www.motorlogy.com/apache/pig/stable/pig-0.10.1.tar.gz ~/repo

How to do it...

Use the following steps to configure Pig:

  1. Log in to the master node from the Hadoop administrator machine as hduser with the following command:
    ssh hduser@master
    
  2. Copy the archive to /usr/local with the following command:
    sudo ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required