Chapter 10: Implementing Real-Time Streaming with Amazon EMR and Spark Streaming
In Chapter 3, Common Use Cases and Architecture Patterns, we discussed different use cases and architecture patterns that you can follow using Amazon EMR, while in Chapter 9,Implementing Batch ETL Pipeline with Amazon EMR and Apache Spark, you learned how you can implement a batch Extract, Transform, and Load (ETL) pipeline using Amazon EMR and PySpark script.
In this chapter, we will dive deep into another use case – real-time streaming with Amazon EMR and Spark Streaming, where we will look at the implementation steps that you can follow to replicate the setup in your AWS account.
Real-time streaming use cases are becoming more popular as distributed processing ...
Get Simplify Big Data Analytics with Amazon EMR now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.