book

Foundations of Scalable Systems

Name: Foundations of Scalable Systems
Author: Ian Gorton
ISBN: 9781098106065

by Ian Gorton

June 2022

Intermediate to advanced

337 pages

9h 23m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
Why Scalability?Who This Book Is ForWhat You Will LearnNote for EducatorsConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
I. The Basics
1. Introduction to Scalable Systems
What Is Scalability?Examples of System Scale in the Early 2000sHow Did We Get Here? A Brief History of System GrowthScalability Basic Design PrinciplesScalability and CostsScalability and Architecture Trade-OffsPerformanceAvailabilitySecurityManageabilitySummary and Further Reading
2. Distributed Systems Architectures: An Introduction
Basic System ArchitectureScale OutScaling the Database with CachingDistributing the DatabaseMultiple Processing TiersIncreasing ResponsivenessSystems and Hardware ScalabilitySummary and Further Reading
3. Distributed Systems Essentials
Communications BasicsCommunications HardwareCommunications SoftwareRemote Method InvocationPartial FailuresConsensus in Distributed SystemsTime in Distributed SystemsSummary and Further Reading
4. An Overview of Concurrent Systems
Why Concurrency?ThreadsOrder of Thread ExecutionProblems with ThreadsRace ConditionsDeadlocksThread StatesThread CoordinationThread PoolsBarrier SynchronizationThread-Safe CollectionsSummary and Further Reading
II. Scalable Systems
5. Application Services
Service DesignApplication Programming Interface (API)Designing ServicesState ManagementApplications ServersHorizontal ScalingLoad BalancingLoad Distribution PoliciesHealth MonitoringElasticitySession AffinitySummary and Further Reading
6. Distributed Caching
Application CachingWeb CachingCache-ControlExpires and Last-ModifiedEtagSummary and Further Reading
7. Asynchronous Messaging
Introduction to MessagingMessaging PrimitivesMessage PersistencePublish–SubscribeMessage ReplicationExample: RabbitMQMessages, Exchanges, and QueuesDistribution and ConcurrencyData Safety and Performance Trade-offsAvailability and Performance Trade-OffsMessaging PatternsCompeting ConsumersExactly-Once ProcessingPoison MessagesSummary and Further Reading

8. Serverless Processing Systems
The Attractions of ServerlessGoogle App EngineThe BasicsGAE Standard EnvironmentAutoscalingAWS LambdaLambda Function Life CycleExecution ConsiderationsScalabilityCase Study: Balancing Throughput and CostsChoosing Parameter ValuesGAE Autoscaling Parameter Study DesignResultsSummary and Further Reading
9. Microservices
The Movement to MicroservicesMonolithic ApplicationsBreaking Up the MonolithDeploying MicroservicesPrinciples of MicroservicesResilience in MicroservicesCascading FailuresBulkhead PatternSummary and Further Reading
III. Scalable Distributed Databases
10. Scalable Database Fundamentals
Distributed DatabasesScaling Relational DatabasesScaling UpScaling Out: Read ReplicasScale Out: Partitioning DataExample: Oracle RACThe Movement to NoSQLNoSQL Data ModelsQuery LanguagesData DistributionThe CAP TheoremSummary and Further Reading
11. Eventual Consistency
What Is Eventual Consistency?Inconsistency WindowRead Your Own WritesTunable ConsistencyQuorum Reads and WritesReplica RepairActive RepairPassive RepairHandling ConflictsLast Writer WinsVersion VectorsSummary and Further Reading
12. Strong Consistency
Introduction to Strong ConsistencyConsistency ModelsDistributed TransactionsTwo-Phase Commit2PC Failure ModesDistributed Consensus AlgorithmsRaftLeader ElectionStrong Consistency in PracticeVoltDBGoogle Cloud SpannerSummary and Further Reading
13. Distributed Database Implementations
RedisData Model and APIDistribution and ReplicationStrengths and WeaknessesMongoDBData Model and APIDistribution and ReplicationStrengths and WeaknessesAmazon DynamoDBData Model and APIDistribution and ReplicationStrengths and WeaknessesSummary and Further Reading
IV. Event and Stream Processing
14. Scalable Event-Driven Processing
Event-Driven ArchitecturesApache KafkaTopicsProducers and ConsumersScalabilityAvailabilitySummary and Further Reading
15. Stream Processing Systems
Introduction to Stream ProcessingStream Processing PlatformsCase Study: Apache FlinkDataStream APIScalabilityData SafetyConclusions and Further Reading
16. Final Tips for Success
AutomationObservabilityDeployment PlatformsData LakesFurther Reading and Conclusions
Index
About the Author

Content preview from Foundations of Scalable Systems

Chapter 1. Introduction to Scalable Systems

The last 20 years have seen unprecedented growth in the size, complexity, and capacity of software systems. This rate of growth is hardly likely to slow in the next 20 years—what future systems will look like is close to unimaginable right now. However, one thing we can guarantee is that more and more software systems will need to be built with constant growth—more requests, more data, and more analysis—as a primary design driver.

Scalable is the term used in software engineering to describe software systems that can accommodate growth. In this chapter I’ll explore what precisely is meant by the ability to scale, known (not surprisingly) as scalability. I’ll also describe a few examples that put hard numbers on the capabilities and characteristics of contemporary applications and give a brief history of the origins of the massive systems we routinely build today. Finally, I’ll describe two general principles for achieving scalability, replication and optimization, which will recur in various forms throughout the rest of this book, and examine the indelible link between scalability and other software architecture quality attributes.

What Is Scalability?

Intuitively, scalability is a pretty straightforward concept. If we ask Wikipedia for a definition, it tells us, “Scalability is the property of a system to handle a growing amount of work by adding resources to the system.” We all know how we scale a highway system—we add more traffic ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Designing Distributed Systems, 2nd Edition

Publisher Resources

ISBN: 9781098106058Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Foundations of Scalable Systems

by Ian Gorton

Chapter 1. Introduction to Scalable Systems

What Is Scalability?

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.