O'Reilly logo

Learning Real-time Processing with Spark Streaming by Sumit Gupta

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Summary

In this chapter, we have discussed in detail the packages and core components and classes of Spark and its extensions. We have also discussed resilient distributed datasets and discretized streams and finally integrated and configured Spark Streaming with Flume and executed our distributed log file processing use case.

In the next chapter, we will discuss and apply various transformation functions over our streaming data and discuss the performance-tuning aspects of our Spark Streaming application/cluster.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required