In this chapter, we have discussed the challenges and programming paradigms for batch and real-time data processing. We also discussed the need for frameworks like Spark and its differences with preexisting frameworks like Hadoop. At the end, we developed and deployed our first Spark Streaming program.
In the next chapter, we will talk about Spark Client APIs and some of its integral components. We will also introduce a real-world use case for distributed data processing in real time.