Book description
Let Hadoop For Dummies help harness the power of your data and rein in the information overload
Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters.
Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications
Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily
Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving
Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster
From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.
Table of contents
-
- Introduction
-
Part I: Getting Started with Hadoop
- Chapter 1: Introducing Hadoop and Seeing What It’s Good For
- Chapter 2: Common Use Cases for Big Data in Hadoop
- Chapter 3: Setting Up Your Hadoop Environment
-
Part II: How Hadoop Works
- Chapter 4: Storing Data in Hadoop: The Hadoop Distributed File System
- Chapter 5: Reading and Writing Data
- Chapter 6: MapReduce Programming
- Chapter 7: Frameworks for Processing Data in Hadoop: YARN and MapReduce
- Chapter 8: Pig: Hadoop Programming Made Easier
- Chapter 9: Statistical Analysis in Hadoop
- Chapter 10: Developing and Scheduling Application Workflows with Oozie
-
Part III: Hadoop and Structured Data
- Chapter 11: Hadoop and the Data Warehouse: Friends or Foes?
- Chapter 12: Extremely Big Tables: Storing Data in HBase
- Chapter 13: Applying Structure to Hadoop Data with Hive
- Chapter 14: Integrating Hadoop with Relational Databases Using Sqoop
- Chapter 15: The Holy Grail: Native SQL Access to Hadoop Data
-
Part IV: Administering and Configuring Hadoop
- Chapter 16: Deploying Hadoop
-
Chapter 17: Administering Your Hadoop Cluster
- Achieving Balance: A Big Factor in Cluster Health
- Mastering the Hadoop Administration Commands
- Understanding Factors for Performance
- Tolerating Faults and Data Reliability
- Putting Apache Hadoop’s Capacity Scheduler to Good Use
- Setting Security: The Kerberos Protocol
- Expanding Your Toolset Options
- Basic Hadoop Configuration Details
-
Part V: The Part of Tens
-
Chapter 18: Ten Hadoop Resources Worthy of a Bookmark
- Central Nervous System: Apache.org
- Tweet This
- Hortonworks University
- Cloudera University
- BigDataUniversity.com
- planet Big Data Blog Aggregator
- Quora’s Apache Hadoop Forum
- The IBM Big Data Hub
- Conferences Not to Be Missed
- The Google Papers That Started It All
- The Bonus Resource: What Did We Ever Do B.G.?
-
Chapter 19: Ten Reasons to Adopt Hadoop
- Hadoop Is Relatively Inexpensive
- Hadoop Has an Active Open Source Community
- Hadoop Is Being Widely Adopted in Every Industry
- Hadoop Can Easily Scale Out As Your Data Grows
- Traditional Tools Are Integrating with Hadoop
- Hadoop Can Store Data in Any Format
- Hadoop Is Designed to Run Complex Analytics
- Hadoop Can Process a Full Data Set (As Opposed to Sampling)
- Hardware Is Being Optimized for Hadoop
- Hadoop Can Increasingly Handle Flexible Workloads (No Longer Just Batch)
- About the Authors
- Cheat Sheet
- More Dummies Products
-
Chapter 18: Ten Hadoop Resources Worthy of a Bookmark
Product information
- Title: Hadoop For Dummies
- Author(s):
- Release date: April 2014
- Publisher(s): For Dummies
- ISBN: 9781118607558
You might also like
book
Hadoop in Practice, Second Edition
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you …
book
Hadoop in Action
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce …
book
Hadoop: The Definitive Guide
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large …
book
Hadoop Essentials
Delve into the key concepts of Hadoop and get a thorough understanding of the Hadoop ecosystem …