Book description
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?
In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
- Peer under the hood of the systems you already use, and learn how to use and operate them more effectively
- Make informed decisions by identifying the strengths and weaknesses of different tools
- Navigate the trade-offs around consistency, scalability, fault tolerance, and complexity
- Understand the distributed systems research upon which modern databases are built
- Peek behind the scenes of major online services, and learn from their architectures
Table of contents
- Preface
- I. Foundations of Data Systems
- 1. Reliable, Scalable, and Maintainable Applications
- 2. Data Models and Query Languages
- 3. Storage and Retrieval
- 4. Encoding and Evolution
- II. Distributed Data
- 5. Replication
- 6. Partitioning
- 7. Transactions
- 8. The Trouble with Distributed Systems
- 9. Consistency and Consensus
- III. Derived Data
- 10. Batch Processing
- 11. Stream Processing
- 12. The Future of Data Systems
- Glossary
- Index
Product information
- Title: Designing Data-Intensive Applications
- Author(s):
- Release date: March 2017
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491903100
You might also like
video
How Slack rearchitected its system to achieve zero downtime and improve latency, reliability and availability
In 2016, Slack faced a problem: the load on its backend servers had increased by 1,000x. …
book
Concurrency in Go
Concurrency can be notoriously difficult to get right, but fortunately, the Go open source programming language …
book
Effective Java, 3rd Edition
Since this Jolt-award winning classic was last updated in 2008, the Java programming environment has changed …
book
Radar Trends to Watch: September 2023
Read about the latest developments on O'Reilly Media's Radar.