Book description
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere
About the Technology
About the Book
It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available.
Readers need to know a programming language like Java and have basic familiarity with Hadoop.
What's Inside
- Thoroughly updated for Hadoop 2
- How to write YARN applications
- Integrate real-time technologies like Storm, Impala, and Spark
- Predictive analytics using Mahout and RR
About the Reader
About the Author
Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects.
Quotes
Very insightful. A deep dive into the Hadoop world.
- Andrea Tarocchi, Red Hat, Inc.
The most complete material on Hadoop and its ecosystem known to mankind!
- Arthur Zubarev, Vital Insights
Clear and concise, full of insights and highly applicable information.
- Edward de Oliveira Ribeiro, DataStax, Inc.
Comprehensive up-to-date coverage of Hadoop 2.
- Muthusamy Manigandan, OzoneMedia
Publisher resources
Table of contents
- Copyright
- Brief Table of Contents
- Table of Contents
- Praise for the First Edition of Hadoop in Practice
- Preface
- Acknowledgments
- About this Book
- About the Cover Illustration
- Part 1. Background and fundamentals
- Part 2. Data logistics
- Part 3. Big data patterns
- Part 4. Beyond MapReduce
- Appendix. Installing Hadoop and friends
- Index
- List of Figures
- List of Tables
- List of Listings
Product information
- Title: Hadoop in Practice, Second Edition
- Author(s):
- Release date: September 2014
- Publisher(s): Manning Publications
- ISBN: 9781617292224
You might also like
book
Hadoop in Action
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce …
book
Hadoop Application Architectures
Get expert guidance on architecting end-to-end data management solutions with Apache Hadoop. While many sources explain …
book
Hadoop For Dummies
Let Hadoop For Dummies help harness the power of your data and rein in the information …
book
Hadoop Operations
If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. …