O'Reilly logo

Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale by Douglas Eadline, Casey Stella, Ofer Mendelevitch

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

4. Getting Data into Hadoop

You can have data without information, but you cannot have information without data.

Daniel Keys Moran

In This Chapter:

Images The data lake concept is presented as a new data processing paradigm.

Images Basic methods for importing CSV data into HDFS and Hive tables are presented.

Images Additional methods for using Spark to import data into Hive tables or directly for a Spark job are presented.

Apache Sqoop is introduced as a tool for ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required