Book description
Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.
While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.
What You Will Learn:
Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5
Run a MapReduce job
Store data with Apache Hive, and Apache HBase
Index data in HDFS with Apache Solr
Develop a Kafka messaging system
Stream Logs to HDFS with Apache Flume
Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop
Create a Hive table over Apache Solr
Develop a Mahout User Recommender System
Who This Book Is For:
Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
Product information
- Title: Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools
- Author(s):
- Release date: October 2016
- Publisher(s): Apress
- ISBN: 9781484221990
You might also like
book
Hadoop Real-World Solutions Cookbook - Second Edition
Over 90 hands-on recipes to help you learn and master the intricacies of Apache Hadoop 2.X, …
book
Hadoop MapReduce v2 Cookbook - Second Edition
Explore the Hadoop MapReduce v2 ecosystem to gain insights from very large datasets In Detail Starting …
book
Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem
Get Started Fast with Apache Hadoop ® 2, YARN, and Today’s Hadoop Ecosystem With Hadoop 2.x …
book
Hadoop 2.x Administration Cookbook
Over 100 practical recipes to help you become an expert Hadoop administrator About This Book Become …