Overview
Why have stream-oriented data systems become so popular, when batch-oriented systems have served big data needs for many years? In the updated edition of this report, Dean Wampler examines the rise of streaming systems for handling time-sensitive problems—such as detecting fraudulent financial activity as it happens. You’ll explore the characteristics of fast data architectures, along with several open source tools for implementing them.
Batch processing isn’t going away, but exclusive use of these systems is now a competitive disadvantage. You’ll learn that, while fast data architectures using tools such as Kafka, Akka, Spark, and Flink are much harder to build, they represent the state of the art for dealing with mountains of data that require immediate attention.
- Learn how a basic fast data architecture works, step-by-step
- Examine how Kafka’s data backplane combines the best abstractions of log-oriented and message queue systems for integrating components
- Evaluate four streaming engines, including Kafka Streams, Akka Streams, Spark, and Flink
- Learn which streaming engines work best for different use cases
- Get recommendations for making real-world streaming systems responsive, resilient, elastic, and message driven
- Explore an example IoT streaming application that includes telemetry ingestion and anomaly detection
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access