book

Designing Data-Intensive Applications, 2nd Edition

by Martin Kleppmann, Chris Riccomini

February 2026

Intermediate to advanced

650 pages

22h 2m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Who Should Read This Book?What’s New in the Second Edition?References and Further ReadingConventions Used in This BookO’Reilly Online LearningHow to Contact UsAcknowledgments
Operational Versus Analytical SystemsCharacterizing Transaction Processing and AnalyticsData WarehousingSystems of Record and Derived DataCloud Versus Self-HostingPros and Cons of Cloud ServicesCloud-Native System ArchitectureOperations in the Cloud EraDistributed Versus Single-Node SystemsProblems with Distributed SystemsMicroservices and ServerlessCloud Computing Versus SupercomputingData Systems, Law, and SocietySummary
Case Study: Social Network Home TimelinesRepresenting Users, Posts, and FollowsMaterializing and Updating TimelinesDescribing PerformanceLatency and Response TimeAverage, Median, and PercentilesUse of Response Time MetricsReliability and Fault ToleranceFault ToleranceHardware and Software FaultsHumans and ReliabilityScalabilityDescribing LoadShared-Memory, Shared-Disk, and Shared-Nothing ArchitecturePrinciples for ScalabilityMaintainabilityOperability: Making Life Easy for OperationsSimplicity: Managing ComplexityEvolvability: Making Change EasySummary
Relational Model versus Document ModelThe Object-Relational MismatchNormalization, Denormalization, and JoinsMany-to-One and Many-to-Many RelationshipsStars and Snowflakes: Schemas for AnalyticsWhen to Use Which ModelGraph-Like Data ModelsProperty GraphsThe Cypher Query LanguageGraph Queries in SQLTriple-Stores and SPARQLDatalog: Recursive Relational QueriesGraphQLEvent Sourcing and CQRSDataFrames, Matrices, and ArraysSummary
Storage and Indexing for OLTPLog-Structured StorageB-TreesComparing B-Trees and LSM-TreesMulti-Column and Secondary IndexesKeeping everything in memoryData Storage for AnalyticsCloud Data WarehousesColumn-Oriented StorageQuery Execution: Compilation and VectorizationMaterialized Views and Data CubesMultidimensional and Full-Text IndexesFull-Text SearchVector EmbeddingsSummary
Formats for Encoding DataLanguage-Specific FormatsJSON, XML, and Binary VariantsProtocol BuffersAvroThe Merits of SchemasModes of DataflowDataflow Through DatabasesDataflow Through Services: REST and RPCDurable Execution and WorkflowsEvent-Driven ArchitecturesSummary
Single-Leader ReplicationSynchronous Versus Asynchronous ReplicationSetting Up New FollowersHandling Node OutagesImplementation of Replication LogsProblems with Replication LagReading Your Own WritesMonotonic ReadsConsistent Prefix ReadsSolutions for Replication LagMulti-Leader ReplicationGeographically Distributed OperationSync Engines and Local-First SoftwareDealing with Conflicting WritesCRDTs and Operational TransformationLeaderless ReplicationWriting to the Database When a Node Is DownLimitations of Quorum ConsistencySingle-Leader Versus Leaderless Replication PerformanceDetecting Concurrent WritesSummary
Pros and Cons of ShardingSharding for MultitenancySharding of Key-Value DataSharding by Key RangeSharding by Hash of KeySkewed Workloads and Relieving Hot SpotsOperations: Automatic or Manual RebalancingRequest RoutingSharding and Secondary IndexesLocal Secondary IndexesGlobal Secondary IndexesSummary
What Exactly Is a Transaction?The Meaning of ACIDSingle-Object and Multi-Object OperationsWeak Isolation LevelsRead CommittedSnapshot Isolation and Repeatable ReadPreventing Lost UpdatesWrite Skew and PhantomsSerializabilityActual Serial ExecutionTwo-Phase Locking (2PL)Serializable Snapshot Isolation (SSI)Distributed TransactionsTwo-Phase Commit (2PC)Distributed Transactions Across Different SystemsDatabase-internal Distributed TransactionsSummary
Faults and Partial FailuresUnreliable NetworksThe Limitations of TCPNetwork Faults in PracticeDetecting FaultsTimeouts and Unbounded DelaysSynchronous Versus Asynchronous NetworksUnreliable ClocksMonotonic Versus Time-of-Day ClocksClock Synchronization and AccuracyRelying on Synchronized ClocksProcess PausesKnowledge, Truth, and LiesThe Majority RulesDistributed Locks and LeasesByzantine FaultsSystem Model and RealityFormal Methods and Randomized TestingSummary

LinearizabilityWhat Makes a System Linearizable?Relying on LinearizabilityImplementing Linearizable SystemsThe Cost of LinearizabilityID Generators and Logical ClocksLogical ClocksLinearizable ID GeneratorsConsensusThe Many Faces of ConsensusConsensus in PracticeCoordination ServicesSummary
Batch Processing with Unix ToolsSimple Log AnalysisChain of Commands Versus Custom ProgramSorting Versus In-memory AggregationBatch Processing in Distributed SystemsDistributed FilesystemsObject StoresDistributed Job OrchestrationBatch Processing ModelsMapReduceDataflow EnginesShuffling DataQuery languagesDataFramesBatch Use CasesExtract–Transform–Load (ETL)AnalyticsMachine LearningServing Derived DataSummary
Transmitting Event StreamsMessaging SystemsLog-based Message BrokersDatabases and StreamsKeeping Systems in SyncChange Data CaptureState, Streams, and ImmutabilityProcessing StreamsUses of Stream ProcessingReasoning About TimeStream JoinsFault ToleranceSummary
Data IntegrationCombining Specialized Tools by Deriving DataBatch and Stream ProcessingUnbundling DatabasesComposing Data Storage TechnologiesDesigning Applications Around DataflowObserving Derived StateAiming for CorrectnessThe End-to-End Argument for DatabasesEnforcing ConstraintsTimeliness and IntegrityTrust, but VerifySummary
Predictive AnalyticsBias and DiscriminationResponsibility and AccountabilityFeedback LoopsPrivacy and TrackingSurveillanceConsent and Freedom of ChoicePrivacy and Use of DataData as Assets and PowerRemembering the Industrial RevolutionLegislation and Self-RegulationSummary

Content preview from Designing Data-Intensive Applications, 2nd Edition

Chapter 1. Trade-offs in Data Systems Architecture

There are no solutions, there are only trade-offs. […] But you try to get the best trade-off you can get, and that’s all you can hope for.

Thomas Sowell, Interview with Fred Barnes (2005)

Data is central to much application development today. With web and mobile apps, software as a service (SaaS), and cloud services, it has become normal to store data from many different users in a shared server-based data infrastructure. Data from user activity, business transactions, devices and sensors needs to be stored and made available for analysis. As users interact with an application, they both read the data that is stored, and also generate more data.

Small amounts of data, which can be stored and processed on a single machine, are often fairly easy to deal with. However, as the data volume or the rate of queries grows, it needs to be distributed across multiple machines, which introduces many challenges. As the needs of the application become more complex, it ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781098119058Errata Page

Designing Data-Intensive Applications, 2nd Edition

by Martin Kleppmann, Chris Riccomini

Chapter 1. Trade-offs in Data Systems Architecture

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like