March 2017
Beginner to intermediate
356 pages
7h 11m
English
So we have gone through the architecture of Spark, and have had some detailed level discussions around RDDs. By the end of Chapter 2, Transformations and Actions with Spark RDDs, we had focused on PairRDDs and some of the transformations.
This chapter focuses on doing ETL with Apache Spark. We'll cover the following topics, which hopefully will help you with taking the next step on Apache Spark:
Let's get started!
ELT stands for Extraction, Transformation,and Loading. The term has been around for decades and it represents an industry standard representing the data movement and transformation process ...
Read now
Unlock full access