Apache Spark changed the landscape of big data and analytics when it came out. Developers welcomed it like nothing else. It quickly became the superstar from ascendant technology. It is one of the most active and contributing open source projects in the big data ecosystem. At the time of writing, there are more than 1000 contributors to the project. Many big data companies have started moving from MapReduce to Spark. And there is no single reason for them to do so. Spark provides improvements in handling data, and it is very easy to work with. Before Spark, if you wanted to do ...
© Vinit Yadav 2017
Vinit Yadav, Processing Big Data with Azure HDInsight, 10.1007/978-1-4842-2869-2_8
8. Exploring Data with Spark
Vinit Yadav1
(1)Ahmedabad, Gujarat, India
Get Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.