1

Introducing Big Data, Hadoop, and Spark

In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log, they didn’t try to grow a larger ox. We shouldn’t be trying for bigger computers, but for more systems of computers.

Rear Admiral Grace Murray Hopper, American computer scientist

In This Chapter:

Introduction to Big Data and the Apache Hadoop project

Basic overview of the Hadoop core components (HDFS and YARN)

Introduction ...

Get Data Analytics with Spark Using Python, First edition now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.