More and more data-driven companies are looking to adopt stream processing and streaming analytics. With this concise ebook, you’ll learn best practices for designing a reliable architecture that supports this emerging big-data paradigm.
Authors Ted Dunning and Ellen Friedman (Real World Hadoop) help you explore some of the best technologies to handle stream processing and analytics, with a focus on the upstream queuing or message-passing layer. To illustrate the effectiveness of these technologies, this book also includes specific use cases.
Ideal for developers and non-technical people alike, this book describes:
- Key elements in good design for streaming analytics, focusing on the essential characteristics of the messaging layer
- New messaging technologies, including Apache Kafka and MapR Streams, with links to sample code
- Technology choices for streaming analytics: Apache Spark Streaming, Apache Flink, Apache Storm, and Apache Apex
- How stream-based architectures are helpful to support microservices
- Specific use cases such as fraud detection and geo-distributed data streams
Ted Dunning is Chief Applications Architect at MapR Technologies, and active in the open source community. He currently serves as VP for Incubator at the Apache Foundation, as a champion and mentor for a large number of projects, and as committer and PMC member of the Apache ZooKeeper and Drill projects. Ted is on Twitter as @ted_dunning.
Ellen Friedman, a committer for the Apache Drill and Apache Mahout projects, is a solutions consultant and well-known speaker and author, currently writing mainly about big data topics. With a PhD in Biochemistry, she has years of experience as a research scientist and has written about a variety of technical topics. Ellen is on Twitter as @Ellen_Friedman.
Table of contents
1. Why Stream?
- Planes, Trains, and Automobiles: Connected Vehicles and the IoT
- Streaming Data: Life As It Happens
- Beyond Real Time: More Benefits of Streaming Architecture
- Emerging Best Practices for Streaming Architectures
- Healthcare Example with Data Streams
- Streaming Data as a Central Aspect of Architectural Design
- 2. Stream-based Architecture
3. Streaming Architecture: Ideal Platform for Microservices
- Why Microservices Matter
- What Is Needed to Support Microservices
- Microservices in More Detail
- Designing a Streaming Architecture: Online Video Service Example
- Importance of a Universal Microarchitecture
- What’s in a Name?
- Why Use Distributed Files and NoSQL Databases?
- New Design for the Video Service
- Summary: The Converged Platform View
4. Kafka as Streaming Transport
- Motivations for Kafka
- Kafka Innovations
- Kafka Basic Concepts
- The Kafka APIs
- Kafka Utility Programs
- Kafka Gotchas
- 5. MapR Streams
- 6. Fraud Detection with Streaming Data
- 7. Geo-Distributed Data Streams
- 8. Putting It All Together
- A. Additional Resources
- Title: Streaming Architecture
- Release date: May 2016
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491953921
You might also like
Making Sense of Stream Processing
How can event streams help make your application more scalable, reliable, and maintainable? In this report, …
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition
Through a series of recent breakthroughs, deep learning has boosted the entire field of machine learning. …
Architecting Modern Data Platforms
There’s a lot of information about big data technologies, but splicing these technologies into an end-to-end …
40 Algorithms Every Programmer Should Know
Learn algorithms for solving classic computer science problems with this concise guide covering everything from fundamental …