Vinit Yadav, Processing Big Data with Azure HDInsight, 10.1007/978-1-4842-2869-2_8

8. Exploring Data with Spark

Vinit Yadav¹

(1)Ahmedabad, Gujarat, India

Apache Spark changed the landscape of big data and analytics when it came out. Developers welcomed it like nothing else. It quickly became the superstar from ascendant technology. It is one of the most active and contributing open source projects in the big data ecosystem. At the time of writing, there are more than 1000 contributors to the project. Many big data companies have started moving from MapReduce to Spark. And there is no single reason for them to do so. Spark provides improvements in handling data, and it is very easy to work with. Before Spark, if you wanted to do ...

Get Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem by Vinit Yadav

8. Exploring Data with Spark

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly