Book description
The go-to guidebook for deploying Big Data solutions with Hadoop
Today's enterprise architects need to understand how the Hadoop frameworks and APIs fit together, and how they can be integrated to deliver real-world solutions. This book is a practical, detailed guide to building and implementing those solutions, with code-level instruction in the popular Wrox tradition. It covers storing data with HDFS and Hbase, processing data with MapReduce, and automating data processing with Oozie. Hadoop security, running Hadoop with Amazon Web Services, best practices, and automating Hadoop processes in real time are also covered in depth.
With in-depth code examples in Java and XML and the latest on recent additions to the Hadoop ecosystem, this complete resource also covers the use of APIs, exposing their inner workings and allowing architects and developers to better leverage and customize them.
The ultimate guide for developers, designers, and architects who need to build and deploy Hadoop applications
Covers storing and processing data with various technologies, automating data processing, Hadoop security, and delivering real-time solutions
Includes detailed, real-world examples and code-level guidelines
Explains when, why, and how to use these tools effectively
Written by a team of Hadoop experts in the programmer-to-programmer Wrox style
Professional Hadoop Solutions is the reference enterprise architects and developers need to maximize the power of Hadoop.
Table of contents
- Cover
- Contents
- Chapter 1: Big Data and the Hadoop Ecosystem
- Chapter 2: Storing Data in Hadoop
- Chapter 3: Processing Your Data with MapReduce
-
Chapter 4: Customizing MapReduce Execution
- Controlling MapReduce Execution with InputFormat
- Reading Data Your Way with Custom RecordReaders
- Organizing Output Data with Custom Output Formats
- Writing Data Your Way with Custom RecordWriters
- Optimizing Your MapReduce Execution with a Combiner
- Controlling Reducer Execution with Partitioners
- Using Non-Java Code with Hadoop
- Summary
- Chapter 5: Building Reliable MapReduce Apps
- Chapter 6: Automating Data Processing with Oozie
-
Chapter 7: Using Oozie
- Validating Information about Places Using Probes
- Designing Place Validation Based on Probes
- Designing Oozie Workflows
- Implementing Oozie Workflow Applications
- Implementing Workflow Activities
- Implementing Oozie Coordinator Applications
- Implementing Oozie Bundle Applications
- Deploying, Testing, and Executing Oozie Applications
- Using the Oozie Console to Get Information about Oozie Applications
- Summary
- Chapter 8: Advanced Oozie Features
- Chapter 9: Real-Time Hadoop
- Chapter 10: Hadoop Security
- Chapter 11: Running Hadoop Applications on AWS
- Chapter 12: Building Enterprise Security Solutions for Hadoop Implementations
- Chapter 13: Hadoop’s Future
- Appendix: Useful Reading
- Introduction
- Advertisements
Product information
- Title: Professional Hadoop Solutions
- Author(s):
- Release date: September 2013
- Publisher(s): Wrox
- ISBN: 9781118611937
You might also like
book
Professional Hadoop
The professional's one-stop guide to this open-source, Java-based big data framework Professional Hadoop is the complete …
book
Securing Hadoop
Implement robust end-to-end security for your Hadoop ecosystem Master the key concepts behind Hadoop security as …
book
PolyBase Revealed: Data Virtualization with SQL Server, Hadoop, Apache Spark, and Beyond
Harness the power of PolyBase data virtualization software to make data from a variety of sources …
book
Hadoop Operations
If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. …