12. Moving Data Into and Out of Hadoop

This chapter covers the following topics:

Image Using HDFS commands to move data to and from Hadoop clusters

Image Using Sqoop to move data between Hadoop and relational databases

Image Ingesting external data with Apache Flume and Apache Kafka

In this chapter, I explain some of the most common ways to move data into and out of HDFS, such as using HDFS file and directory commands and the DistCp (Distributed Copy) tool, which ...

Get Expert Hadoop® Administration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.