Book description
Every enterprise application creates data, whether it’s log messages, metrics, user activity, outgoing messages, or something else. And how to move all of this data becomes nearly as important as the data itself. If you’re an application architect, developer, or production engineer new to Apache Kafka, this practical guide shows you how to use this open source streaming platform to handle real-time data feeds.
Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and build scalable stream-processing applications with this platform. Through detailed examples, you’ll learn Kafka’s design principles, reliability guarantees, key APIs, and architecture details, including the replication protocol, the controller, and the storage layer.
- Understand publish-subscribe messaging and how it fits in the big data ecosystem.
- Explore Kafka producers and consumers for writing and reading messages
- Understand Kafka patterns and use-case requirements to ensure reliable data delivery
- Get best practices for building data pipelines and applications with Kafka
- Manage Kafka in production, and learn to perform monitoring, tuning, and maintenance tasks
- Learn the most critical metrics among Kafka’s operational measurements
- Explore how Kafka’s stream delivery capabilities make it a perfect source for stream processing systems
Publisher resources
Table of contents
- Foreword
- Preface
- 1. Meet Kafka
- 2. Installing Kafka
- 3. Kafka Producers: Writing Messages to Kafka
-
4. Kafka Consumers: Reading Data from Kafka
- Kafka Consumer Concepts
- Creating a Kafka Consumer
- Subscribing to Topics
- The Poll Loop
- Configuring Consumers
- Commits and Offsets
- Rebalance Listeners
- Consuming Records with Specific Offsets
- But How Do We Exit?
- Deserializers
- Standalone Consumer: Why and How to Use a Consumer Without a Group
- Older Consumer APIs
- Summary
- 5. Kafka Internals
- 6. Reliable Data Delivery
- 7. Building Data Pipelines
- 8. Cross-Cluster Data Mirroring
- 9. Administering Kafka
- 10. Monitoring Kafka
- 11. Stream Processing
- A. Installing Kafka on Other Operating Systems
- Index
Product information
- Title: Kafka: The Definitive Guide
- Author(s):
- Release date: September 2017
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491936160
You might also like
book
Kafka: The Definitive Guide, 2nd Edition
Every enterprise application creates data, whether it consists of log messages, metrics, user activity, or outgoing …
book
Kafka in Action
Master the wicked-fast Apache Kafka streaming platform through hands-on examples and real-world projects. In Kafka in …
book
Mastering Kafka Streams and ksqlDB
Working with unbounded and fast-moving data streams has historically been difficult. But with Kafka Streams and …
book
Kafka Connect
Used by more than 80% of Fortune 100 companies, Apache Kafka has become the de facto …