Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem

by Vinit Yadav

Released May 2017

Publisher(s): Apress

ISBN: 9781484228685

Start your free trial

Book description

Get a jump start on using Azure HDInsight and Hadoop Ecosystem components. As most Hadoop and Big Data projects are written in either Java, Scala, or Python, this book minimizes the effort to learn another language and is written from the perspective of a .NET developer. Hadoop components are covered, including Hive, Pig, HBase, Storm, and Spark on Azure HDInsight, and code samples are written in .NET only.

Processing Big Data with Azure HDInsight covers the fundamentals of big data, how businesses are using it to their advantage, and how Azure HDInsight fits into the big data world. This book introduces Hadoop and big data concepts and then dives into creating different solutions with HDInsight and the Hadoop Ecosystem. It covers concepts with real-world scenarios and code examples, making sure you get hands-on experience. The best way to utilize this book is to practice while reading. After reading this book you will be familiar with Azure HDInsight and how it can be utilized to build big data solutions, including batch processing, stream analytics, interactive processing, and storing and retrieving data in an efficient manner.

What You'll Learn

Understand the fundamentals of HDInsight and Hadoop
Work with HDInsight cluster
Query with Apache Hive and Apache Pig
Store and retrieve data with Apache HBase
Stream data processing using Apache Storm
Work with Apache Spark

Who This Book Is For

Software developers, technical architects, data scientists/analyts, and Hadoop administrators who want to develop on Microsoft's managed Hadoop offering, HDInsight

Product information

Title: Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem
Author(s): Vinit Yadav
Release date: May 2017
Publisher(s): Apress
ISBN: 9781484228685

book

Sams Teach Yourself: Big Data Analytics with Microsoft HDInsight in 24 Hours, Big Data, Hadoop, and Microsoft Azure for Better Business Intelligence

by Manpreet Singh, Arshad Ali

This is the Rough Cut version of the printed book. With The world of data is …

video

Learn Hadoop and Azure HDInsight Basics this Evening (in 2 hours)

by Eshant Garg

The Apache Hadoop is a framework that allows for the distributed processing of large data sets …

video

Creating an extensible 100+ PB real-time big data platform by unifying storage and serving

by Reza Shiftehfar

Uber relies heavily on making data-driven decisions in every product area and needs to store and …

video

SQL Server 2019 Big Data Clusters Crash Course: Installing and Using a Big Data Cluster for Data Analysis

by Buck Woody

Learn the architecture and implementation of Microsoft’s latest SQL Server capability - Big Data Clusters. Combine …

Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem

Book description

Table of contents

Product information

You might also like

Sams Teach Yourself: Big Data Analytics with Microsoft HDInsight in 24 Hours, Big Data, Hadoop, and Microsoft Azure for Better Business Intelligence

Learn Hadoop and Azure HDInsight Basics this Evening (in 2 hours)

Creating an extensible 100+ PB real-time big data platform by unifying storage and serving

SQL Server 2019 Big Data Clusters Crash Course: Installing and Using a Big Data Cluster for Data Analysis

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly