book

Designing Event-Driven Systems

by Ben Stopford

May 2018

Intermediate to advanced

171 pages

3h 58m

English

O'Reilly Media, Inc.

Read now

Unlock full access

How to Read This BookAcknowledgments
Kafka Is Like REST but Asynchronous?Kafka Is Like a Service Bus?Kafka Is Like a Database?What Is Kafka Really? A Streaming Platform
The Log: An Efficient Structure for Retaining and Distributing MessagesLinear ScalabilitySegregating Load in Multiservice EcosystemsMaintaining Strong Ordering GuaranteesEnsuring Messages Are DurableLoad-Balance Services and Make Them Highly AvailableCompacted TopicsLong-Term Data StorageSecuritySummary
Commands, Events, and QueriesCoupling and Message BrokersIs Loose Coupling Always Good?Essential Data Coupling Is UnavoidableUsing Events for NotificationUsing Events to Provide State TransferWhich Approach to UseThe Event Collaboration PatternRelationship with Stream ProcessingMixing Request- and Event-Driven ProtocolsSummary
Making Services StatefulThe Event-Driven ApproachThe Pure (Stateless) Streaming ApproachThe Stateful Streaming ApproachThe Practicalities of Being StatefulSummary

Event Sourcing, Command Sourcing, and CQRS in a NutshellVersion Control for Your DataMaking Events the Source of TruthCommand Query Responsibility SegregationMaterialized ViewsPolyglot ViewsWhole Fact or Delta?Implementing Event Sourcing and CQRS with KafkaBuild In-Process Views with Tables and State Stores in Kafka StreamsWriting Through a Database into a Kafka Topic with Kafka ConnectWriting Through a State Store to a Kafka Topic in Kafka StreamsUnlocking Legacy Systems with CDCQuery a Read-Optimized View Created in a DatabaseMemory Images/Prepopulated CachesThe Event-Sourced ViewSummary
Encapsulation Isn’t Always Your FriendThe Data DichotomyWhat Happens to Systems as They Evolve?The God Service ProblemThe REST-to-ETL ProblemMake Data on the Outside a First-Class CitizenDon’t Be Afraid to EvolveSummary
A Database Inside OutSummary
If Messaging Remembers, Databases Don’t Have ToTake Only the Data You Need, Nothing MoreRebuilding Event-Sourced ViewsKafka StreamsDatabases and CachesHandling the Impracticalities of Data MovementAutomation and Schema MigrationThe Data Divergence ProblemSummary
Eventual ConsistencyTimelinessCollisions and MergingThe Single Writer PrincipleCommand TopicSingle Writer Per TransitionAtomicity with TransactionsIdentity and Concurrency ControlLimitationsSummary
The Duplicates ProblemUsing the Transactions API to Remove DuplicatesExactly Once Is Both Idempotence and Atomic CommitHow Kafka’s Transactions Work Under the CoversStore State and Send Events AtomicallyDo We Need Transactions? Can We Do All This with Idempotence?What Can’t Transactions Do?Making Use of Transactions in Your ServicesSummary
Using Schemas to Manage the Evolution of Data in TimeHandling Schema Change and Breaking Backward CompatibilityCollaborating over Schema ChangeHandling Unreadable MessagesDeleting DataTriggering Downstream DeletesSegregating Public and Private TopicsSummary
A Simple Email Service Built with Kafka Streams and KSQLWindows, Joins, Tables, and State StoresSummary
An Order Validation EcosystemJoin-Filter-ProcessEvent-Sourced Views in Kafka StreamsCollapsing CQRS with a Blocking ReadScaling Concurrent Operations in Streaming SystemsRekey to JoinRepartitioning and Staged ExecutionWaiting for N EventsReflecting on the DesignA More Holistic Streaming EcosystemSummary

Content preview from Designing Event-Driven Systems

Chapter 12. Transactions, but Not as We Know Them

Kafka ships with built-in transactions, in much the same way that most relational databases do. The implementation is quite different, as we will see, but the goal is similar: to ensure that our programs create predictable and repeatable results, even when things fail.

Transactions do three important things in a services context:

They remove duplicates, which cause many streaming operations to get incorrect results (even something as simple as a count).
They allow groups of messages to be sent, atomically, to different topics—for example, Order Confirmed and Decrease Stock Level, which would leave the system in an inconsistent state if only one of the two succeeded.
Because Kafka Streams uses state stores, and state stores are backed by a Kafka topic, when we save data to the state store, then send a message to another service, we can wrap the whole thing in a transaction. This property turns out to be particularly useful.

In this chapter we delve into transactions, looking at the problems they solve, how we should make use of them, and how they actually work under the covers.

The Duplicates Problem

Any service-based architecture is itself a distributed system, a field renowned for being difficult, particularly when things go wrong. Thought experiments like the Two Generals’ Problem and proofs like FLP highlight these inherent difficulties. But in practice the problem seems less complex. If you make a call to a service ...