O'Reilly logo

Hadoop 2.x Administration Cookbook by Gurmukh Singh

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Loading data into Hive

In this recipe, we look at how we can import data into Hive and also how we can point it to existing data using an external table.

The data store formats for Hive can be text, ORC and parquet, as well as a few other formats. Each one has its advantages in terms of compression, performance, space utilization and memory overheads.

Getting ready

To progress through the recipe, you must have completed the recipe Using MySQL for Hive metastore. There are many examples of each type of Hive distribution at $HIVE_HOME/examples.

How to do it...

  1. Connect to the edge node edge1.cyrus.com in the cluster and switch to the hadoop user.
  2. Connect by either using Hive or the beeline client and import the data by creating a table as shown in the ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required