12. Moving Data Into and Out of Hadoop

This chapter covers the following topics:

Image Using HDFS commands to move data to and from Hadoop clusters

Image Using Sqoop to move data between Hadoop and relational databases

Image Ingesting external data with Apache Flume and Apache Kafka

In this chapter, I explain some of the most common ways to move data into and out of HDFS, such as using HDFS file and directory commands and the DistCp (Distributed Copy) tool, which ...

Get Expert Hadoop® Administration now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.