O'Reilly logo

Hadoop MapReduce v2 Cookbook - Second Edition by Thilina Gunarathne

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 1. Getting Started with Hadoop v2

In this chapter, we will cover the following recipes:

  • Setting up standalone Hadoop v2 on your local machine
  • Writing a WordCount MapReduce application, bundling it, and running it using Hadoop local mode
  • Adding a combiner step to the WordCount MapReduce program
  • Setting up HDFS
  • Setting up Hadoop YARN in a distributed cluster environment using Hadoop v2
  • Setting up Hadoop ecosystem in a distributed cluster environment using a Hadoop distribution
  • HDFS command-line file operations
  • Running the WordCount program in a distributed cluster environment
  • Benchmarking HDFS using DFSIO
  • Benchmarking Hadoop MapReduce using TeraSort

Introduction

We are living in the era of big data, where exponential growth of phenomena such as web, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required