Azure HDInsight is a managed Hadoop distribution, developed in partnership with Hortonworks and Microsoft. It uses the Hortonworks Data Platform (HDP) Hadoop distribution, which means that HDInsight is entirely Apache Hadoop on Azure. It deploys and provisions managed Apache Hadoop clusters in the cloud on Windows or Linux machines, which is a unique capability. It provides the Hadoop Distributed File System (HDFS) for reliable data storage. It uses the MapReduce programming model to process, analyze, and report on data stored in distributed file systems. Because it is ...
© Vinit Yadav 2017
Vinit Yadav, Processing Big Data with Azure HDInsight, 10.1007/978-1-4842-2869-2_1
1. Big Data, Hadoop, and HDInsight
(1)Ahmedabad, Gujarat, India