Book description
“This book is a critically needed
resource for the newly released Apache Hadoop 2.0, highlighting
YARN as the significant breakthrough that broadens Hadoop beyond
the MapReduce paradigm.”
—From the Foreword by Raymie Stata, CEO of
Altiscale
The Insider’s Guide to Building Distributed, Big Data
Applications with Apache Hadoop™ YARN
Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop™ YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revolutionary advances.
YARN project founder Arun Murthy and project lead Vinod Kumar Vavilapalli demonstrate how YARN increases scalability and cluster utilization, enables new programming models and services, and opens new options beyond Java and batch processing. They walk you through the entire YARN project lifecycle, from installation through deployment.
You’ll find many examples drawn from the authors’ cutting-edge experience—first as Hadoop’s earliest developers and implementers at Yahoo! and now as Hortonworks developers moving the platform forward and helping customers succeed with it.
Coverage includes
YARN’s goals, design, architecture, and components—how it expands the Apache Hadoop ecosystem
Exploring YARN on a single node
Administering YARN clusters and Capacity Scheduler
Running existing MapReduce applications
Developing a large-scale clustered YARN application
Discovering new open source frameworks that run under YARN
Table of contents
- About This eBook
- Title Page
- Copyright Page
- Contents
- Foreword by Raymie Stata
- Foreword by Paul Dix
- Preface
- Acknowledgments
- About the Authors
- 1. Apache Hadoop YARN: A Brief History and Rationale
- 2. Apache Hadoop YARN Install Quick Start
- 3. Apache Hadoop YARN Core Concepts
- 4. Functional Overview of YARN Components
- 5. Installing Apache Hadoop YARN
- 6. Apache Hadoop YARN Administration
- 7. Apache Hadoop YARN Architecture Guide
- 8. Capacity Scheduler in YARN
- 9. MapReduce with Apache Hadoop YARN
- 10. Apache Hadoop YARN Application Example
- 11. Using Apache Hadoop YARN Distributed-Shell
- 12. Apache Hadoop YARN Frameworks
- A. Supplemental Content and Code Downloads
- B. YARN Installation Scripts
- C. YARN Administration Scripts
- D. Nagios Modules
- E. Resources and Additional Information
- F. HDFS Quick Reference
- Index
Product information
- Title: Apache Hadoop™ YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop™ 2
- Author(s):
- Release date: March 2014
- Publisher(s): Addison-Wesley Professional
- ISBN: 9780133441925
You might also like
book
Apache Hadoop 3 Quick Start Guide
A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem …
video
Apache Hadoop YARN LiveLessons (Video Training)
Apache Hadoop YARN Fundamentals LiveLessonsis the first complete video training course on the basics of Apache …
book
Optimizing Hadoop for MapReduce
This book is the perfect introduction to sophisticated concepts in MapReduce and will ensure you have …
video
Hadoop and Spark Fundamentals
9+ Hours of Video Instruction The perfect (and fast) way to get started with Hadoop and …