O'Reilly logo

Apache Accumulo for Developers by Guðmundur Jón Halldórsson

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Setting up Hadoop

Hadoop is a Java application framework and is designed to run on a large cluster of inexpensive hardware. As Hadoop is written in Java, it requires a working Java 1.6.x installation. Both SSH and SSHD must be running to use the Hadoop scripts remotely. For Windows installation, Cygwin is required. If Hadoop is already installed and running, you can skip this section.

SSH configuration

Hadoop uses SSH access to manage its nodes, both remote and local machines. Even if we only want to set up a local development box, we need to configure SSH access. To simplify, we should create a dedicated Hadoop user (we are going to do this for ZooKeeper and Accumulo in later sections of this chapter).

Creating a Hadoop user

A Hadoop user can be ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required