Book description
Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data.
About the Technology
Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data.
About the Book
Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs.
What's Inside
- Conceptual overview of Hadoop and MapReduce
- 85 practical, tested techniques
- Real problems, real solutions
- How to integrate MapReduce and R
About the Reader
This book assumes you've already started exploring Hadoop and want concrete advice on how to use it in production.
About the Author
Alex Holmes is a senior software engineer with extensive expertise in solving big data problems using Hadoop. He has presented at JavaOne and Jazoon and is a technical lead at VeriSign.
Quotes
Interesting topics that tickle the creative brain.
- Mark Kemna, Brillig
Ties together the Hadoop ecosystem technologies.
- Ayon Sinha, Britely
Comprehensive … high-quality code samples.
- Chris Nauroth, The Walt Disney Company
Covers all of the variants of Hadoop, not just the Apache distribution.
- Ted Dunning, MapR Technologies
Charts a path to the future.
- Alexey Gayduk, Grid Dynamics
Publisher resources
Table of contents
- Copyright
- Brief Table of Contents
- Table of Contents
- Preface
- Acknowledgments
- About this Book
- Part 1. Background and fundamentals
- Part 2. Data logistics
- Part 3. Big data patterns
- Part 4. Data science
- Part 5. Taming the elephant
- Appendix A. Related technologies
- Appendix B. Hadoop built-in ingress and egress tools
- Appendix C. HDFS dissected
- Appendix D. Optimized MapReduce join frameworks
- Index
- List of Figures
- List of Tables
- List of Examples
Product information
- Title: Hadoop in Practice
- Author(s):
- Release date: October 2012
- Publisher(s): Manning Publications
- ISBN: 9781617290237
You might also like
book
Hadoop in Practice, Second Edition
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you …
book
Hadoop in Action
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce …
book
Hadoop Operations
If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. …
book
Hadoop: The Definitive Guide
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large …