Hour 2. Understanding Hadoop
What You’ll Learn in This Hour:
Background on big data and Hadoop
The basics of the Hadoop Distributed File System (HDFS)
An overview of YARN, Hadoop’s resource scheduler
How Spark is used with Hadoop
Big data and Hadoop are inexorably linked together. Hadoop as a data storage and processing platform was a major reason ...
Get Sams Teach Yourself Apache Spark™ in 24 Hours now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.