Book description
Every enterprise application creates data, whether it’s log messages, metrics, user activity, outgoing messages, or something else. And how to move all of this data becomes nearly as important as the data itself. If you’re an application architect, developer, or production engineer new to Apache Kafka, this practical guide shows you how to use this open source streaming platform to handle real-time data feeds.
Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and build scalable stream-processing applications with this platform. Through detailed examples, you’ll learn Kafka’s design principles, reliability guarantees, key APIs, and architecture details, including the replication protocol, the controller, and the storage layer.
- Understand publish-subscribe messaging and how it fits in the big data ecosystem.
- Explore Kafka producers and consumers for writing and reading messages
- Understand Kafka patterns and use-case requirements to ensure reliable data delivery
- Get best practices for building data pipelines and applications with Kafka
- Manage Kafka in production, and learn to perform monitoring, tuning, and maintenance tasks
- Learn the most critical metrics among Kafka’s operational measurements
- Explore how Kafka’s stream delivery capabilities make it a perfect source for stream processing systems
Publisher resources
Table of contents
- Foreword
- Preface
- 1. Meet Kafka
- 2. Installing Kafka
- 3. Kafka Producers: Writing Messages to Kafka
-
4. Kafka Consumers: Reading Data from Kafka
- Kafka Consumer Concepts
- Creating a Kafka Consumer
- Subscribing to Topics
- The Poll Loop
- Configuring Consumers
- Commits and Offsets
- Rebalance Listeners
- Consuming Records with Specific Offsets
- But How Do We Exit?
- Deserializers
- Standalone Consumer: Why and How to Use a Consumer Without a Group
- Older Consumer APIs
- Summary
- 5. Kafka Internals
- 6. Reliable Data Delivery
- 7. Building Data Pipelines
- 8. Cross-Cluster Data Mirroring
- 9. Administering Kafka
- 10. Monitoring Kafka
- 11. Stream Processing
- A. Installing Kafka on Other Operating Systems
- Index
Product information
- Title: Kafka: The Definitive Guide
- Author(s):
- Release date: September 2017
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491936160
You might also like
book
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
book
Modern Software Engineering: Doing What Works to Build Better Software Faster
Improve Your Creativity, Effectiveness, and Ultimately, Your Code In Modern Software Engineering, continuous delivery pioneer David …
book
Effective Java, 3rd Edition
Since this Jolt-award winning classic was last updated in 2008, the Java programming environment has changed …
book
The Staff Engineer's Path
For years, companies have rewarded their most effective engineers with management positions. But treating management as …