Book description
Hadoop in Action teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. Hadoop in Action will lead the reader from obtaining a copy of Hadoop to setting it up in a cluster and writing data analytic programs. The book begins by making the basic idea of Hadoop and MapReduce easier to grasp by applying the default Hadoop installation to a few easy-to-follow tasks, such as analyzing changes in word frequency across a body of documents. The book continues through the basic concepts of MapReduce applications developed using Hadoop, including a close look at framework components, use of Hadoop for a variety of data analysis tasks, and numerous examples of Hadoop in action. Hadoop in Action will explain how to use Hadoop and present design patterns and practices of programming MapReduce. MapReduce is a complex idea both conceptually and in its implementation, and Hadoop users are challenged to learn all the knobs and levers for running Hadoop. This book takes you beyond the mechanics of running Hadoop, teaching you to write meaningful programs in a MapReduce framework. This book assumes the reader will have a basic familiarity with Java, as most code examples will be written in Java. Familiarity with basic statistical concepts (e.g. histogram, correlation) will help the reader appreciate the more advanced data processing examples.
Table of contents
- Copyright
- Brief Table of Contents
- Table of Contents
- Preface
- Acknowledgments
- About this Book
- Author Online
- About the author
- About the cover illustration
- Part 1. Hadoop—A Distributed Programming Framework
- Chapter 1. Introducing Hadoop
- Chapter 2. Starting Hadoop
- Chapter 3. Components of Hadoop
- Part 2. Hadoop in Action
- Chapter 4. Writing basic MapReduce programs
- Chapter 5. Advanced MapReduce
- Chapter 6. Programming Practices
- Chapter 7. Cookbook
- Chapter 8. Managing Hadoop
- Part 3. Hadoop Gone Wild
- Chapter 9. Running Hadoop in the cloud
- Chapter 10. Programming with Pig
- Chapter 11. Hive and the Hadoop herd
- Chapter 12. Case studies
- Appendix A. HDFS file commands
- Index
- List of Figures
- List of Tables
- List of Listings
Product information
- Title: Hadoop in Action
- Author(s):
- Release date: December 2010
- Publisher(s): Manning Publications
- ISBN: 9781935182191
You might also like
book
Learning Git
This book teaches Git in a simple, visual, and tangible manner so that you can build …
book
How Linux Works, 3rd Edition
Unlike some operating systems, Linux doesn’t try to hide the important bits from you—it gives you …
book
Tidy First?
Messy code is a nuisance. "Tidying" code, to make it more readable, requires breaking it up …
book
Hadoop: The Definitive Guide, 4th Edition
Get ready to unlock the power of your data. With the fourth edition of this comprehensive …