Book description
Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft's own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) offerings available through Windows Azure lets you take advantage of Hadoop's processing power without the worry of creating, configuring, maintaining, or managing your own cluster.
With the data explosion that is soon to happen, the open source Apache Hadoop Framework is gaining traction, and it benefits from a huge ecosystem that has risen around the core functionalities of the Hadoop distributed file system (HDFS™) and Hadoop Map Reduce. Pro Microsoft HDInsight equips you with the knowledge, confidence, and technique to configure and manage this ecosystem on Windows Azure. The book is an excellent choice for anyone aspiring to be a data scientist or data engineer, putting you a step ahead in the data mining field.
Guides you through installation and configuration of an HDInsight cluster on Windows Azure
Provides clear examples of configuring and executing Map Reduce jobs
Helps you consume data and diagnose errors from the Windows Azure HDInsight Service
What you'll learn
Create and Manage HDInsight clusters on Windows Azure
Understand the different HDInsight services and configuration files
Develop and run Map Reduce jobs using .NET and PowerShell
Consume data from client applications like Microsoft Excel and Power View
Monitor job executions and logs
Troubleshoot common problems
Who this book is for
Pro Microsoft HDInsight: Hadoop on Windows is an excellent choice for developers in the field of business intelligence and predictive analysis who want that extra edge in technology on Microsoft Windows and Windows Azure platforms. The book is for people who love to slice and dice data, and identify trends and patterns through analysis of data to help in creative and intelligent decision making.
Table of contents
- Title Page
- Dedication
- Contents at a Glance
- Contents
- About the Author
- About the Technical Reviewers
- Acknowledgments
- Introduction
- CHAPTER 1: Introducing HDInsight
- CHAPTER 2: Understanding Windows Azure HDInsight Service
- CHAPTER 3: Provisioning Your HDInsight Service Cluster
- CHAPTER 4: Automating HDInsight Cluster Provisioning
- CHAPTER 5: Submitting Jobs to Your HDInsight Cluster
- CHAPTER 6: Exploring the HDInsight Name Node
- CHAPTER 7: Using Windows Azure HDInsight Emulator
- CHAPTER 8: Accessing HDInsight over Hive and ODBC
- CHAPTER 9: Consuming HDInsight from Self-Service BI Tools
- CHAPTER 10: Integrating HDInsight with SQL Server Integration Services
- CHAPTER 11: Logging in HDInsight
- CHAPTER 12: Troubleshooting Cluster Deployments
- CHAPTER 13: Troubleshooting Job Failures
- Index
Product information
- Title: Pro Microsoft HDInsight: Hadoop on Windows
- Author(s):
- Release date: February 2014
- Publisher(s): Apress
- ISBN: 9781430260554
You might also like
book
Oracle Streams 11g Data Replication
Master Oracle Streams 11 g Replication Enable real-time information access and data sharing across your distributed …
book
HDInsight Essentials - Second Edition
Learn how to build and deploy a modern big data architecture to empower your business In …
book
Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem
Get Started Fast with Apache Hadoop ® 2, YARN, and Today’s Hadoop Ecosystem With Hadoop 2.x …
book
Certification Study Guide Series: IBM Maximo Asset Management V7.1
This IBM® Redbooks® publication is a study guide for IBM Maximo® Asset Management V7.1 and is …