book

Modern System Administration

Name: Modern System Administration
Author: Jennifer Davis
ISBN: 9781492055211

by Jennifer Davis

November 2022

Intermediate to advanced

325 pages

8h 13m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Foreword
Preface
Who Should Read This Book?What This Book Is NotScope of This BookIf I Could Tell You Only One ThingIf I Could Tell You Only One More ThingConventions Used in This BookO’Reilly Online LearningHow to Contact UsAcknowledgments
Introducing Modern System Administration
Map Your JourneyEmbrace a Mindset ShiftWhat Is the Job?Flavors of System AdministrationEmbrace Evolving PracticesEmbrace CollaborationEmbrace SustainabilityWrapping Up
I. Reasoning About Systems
1. Patterns and Interconnections
How to Connect ThingsHow Things CommunicateApplication LayerTransport LayerNetwork LayerData Link LayerPhysical LayerWrapping Up
2. Computing Environments
Common WorkloadsChoosing the Location of Your WorkloadsOn-PremCloud ComputingCompute OptionsServerlessContainersVirtual MachinesGuidelines for Choosing ComputeWrapping Up
3. Storage
Why Care About Storage?Key CharacteristicsStorage CategoriesBlock StorageFile StorageObject StorageDatabase StorageConsiderations for Your Storage StrategyAnticipate Your Capacity and Latency RequirementsRetain Your Data as Long as Is Reasonably NecessaryRespect the Privacy Concerns of Your UsersDefend Your DataBe Prepared to Handle Disaster Recovery SituationsWrapping Up
4. Network
Caring About NetworksKey Characteristics of NetworksBuild a NetworkVirtualizationSoftware-Defined NetworksContent Distribution NetworksGuidelines to Your Network StrategyWrapping Up
II. Practices
5. Sysadmin Toolkit
What Is Your Digital Toolkit?The Components of Your ToolkitChoosing an EditorChoosing Programming LanguagesFrameworks and LibrariesOther Helpful UtilitiesWrapping Up

6. Version Control
What Is Version Control?Benefits of Version ControlOrganizing Infra ProjectsWrapping Up
7. Testing
You’re Already TestingCommon Types of TestingLintingUnit TestsIntegration TestsEnd-to-End TestsExplicit Testing StrategyImproving Your Tests; Learning from FailureNext StepsWrapping Up
8. Infrastructure Security
What Is Infrastructure Security?Share Security ResponsibilitiesBorrow the Attacker LensDesign for Security OperabilityCategorize Discovered IssuesWrapping Up
9. Documentation
Know Your AudienceDimensions of DocumentationOrganization PracticesOrganizing a TopicOrganizing a SiteRecommendations for Quality DocumentationWrapping Up
10. Presentations
Know Your AudienceChoose Your ChannelChoose Your Story TypeStorytelling in PracticeCase #1: Charts Are Worth a Thousand WordsCase #2: Telling the Same Story with a Different AudienceThe Key TakeawaysKnow Your VisualsVisual CuesChart TypesRecommended Visualization PracticesWrapping Up
III. Assembling the System
11. Scripting Infrastructure
Why Script Your Infrastructure?Three Lenses to Model Your InfrastructureCode to Build Machine ImagesCode to Provision InfrastructureCode to Configure InfrastructureGetting StartedWrapping Up
12. Managing Your Infrastructure
Infrastructure as CodeTreating Your Infrastructure as DataGetting Started with Infrastructure ManagementLintingWriting Unit TestsWriting Integration TestsWriting End-to-End TestsWrapping Up
13. Securing Your Infrastructure
Assessing Attack VectorsManage Identity and AccessHow Should You Control Access to Your System?Who Should Have Access to Your System?Manage SecretsPassword Managers and Secret Management SoftwareDefending Secrets and Monitoring UsageSecuring Your Computing EnvironmentSecuring Your NetworkSecurity Recommendations for Your Infrastructure ManagementWrapping Up
IV. Monitoring the System
14. Monitoring Theory
Why Monitor?How Do Monitoring and Observability Differ?Monitoring Building BlocksEventsMonitorsData: Metrics, Logs, and TracingFirst-Level MonitoringEvent DetectionData CollectionData ReductionData AnalysisData PresentationSecond-Level MonitoringWrapping Up
15. Compute and Software Monitoring in Practice
Identify Your Desired OutputsWhat Should You Monitor?Do What You Can NowMonitors That MatterPlan for a Monitoring ProjectWhat Alerts Should You Set?Examine Monitoring PlatformsChoose a Monitoring Tool or PlatformWrapping Up
16. Managing Monitoring Data
What Is Monitoring Data?MetricsLogsStructured LogsTracingDistributed TracingChoose Your Data TypesRetain Log DataAnalyze Log DataMonitoring Data at ScaleWrapping Up
17. Monitor Your Work
Why Should You Monitor Your Work?Manage Your Work with KanbanChoose a PlatformFind the Interesting InformationWrapping Up
V. Scaling the System
18. Capacity Management
What Is Capacity?The Capacity Management ModelResource ProcurementJustificationManagementMonitoringThe Framework for Capacity PlanningDo You Need Capacity Planning with Cloud Computing?Wrapping Up
19. Developing On-Call Resilience
What Is On-Call?Humane On-Call ProcessesCheck Your On-Call PoliciesPreparing for On-CallOne Week OutThe Night BeforeYour On-Call RotationOn-Call HandoffThe Day After On-CallMonitor the On-Call ExperienceWrapping Up
20. Managing Incidents
What Is an Incident?What Is Incident Management?Planning and Preparing for IncidentsSet Up and Document Communication ChannelsTrain for Effective CommunicationCreate TemplatesMaintain DocumentationDocument the RisksPractice FailureUnderstand Your ToolsClearly Define Roles and ResponsibilitiesUnderstand Severity Levels and Escalation ProtocolsResponding to IncidentsLearning from the IncidentHow Deep Should You Dig?Aiding DiscoveryDocumenting Incidents EffectivelyDistributing the InformationNext StepsWrapping Up
21. Leading Sustainable Teams
Collective LeadershipAdopt a Whole-Team ApproachBuild Resilient On-Call TeamsUpdate On-Call ProcessesMonitor the Team’s WorkWhy Monitor the Team?What Should You Monitor?Measure Impact on the TeamSupport Team Infrastructure with DocumentationBudget a Learning CultureAdapt to ChallengesWrapping Up
Conclusion
A. Protocols in Practice
Hypertext Transfer ProtocolQUICDomain Name System
B. Resolving Test Failures
Test Failure Type #1: Environment ProblemsTest Failure Type #2: Flawed Test LogicTest Failure Type #3: Changing AssumptionsTest Failure Type #4: Flaky TestsTest Failure Type #5: Code Defects
Index
About the Author

Overview

Early system administration required in-depth knowledge of a variety of services on individual systems. Now, the job is increasingly complex and different from one company to the next with an ever-growing list of technologies and third-party services to integrate. How does any one individual stay relevant in systems and services? This practical guide helps anyone in operations—sysadmins, automation engineers, IT professionals, and site reliability engineers—understand the essential concepts of the role today.

Collaboration, automation, and the evolution of systems change the fundamentals of operations work. No matter where you are in your journey, this book provides you the information to craft your path to advancing essential system administration skills. Author Jennifer Davis provides examples of modern practices and tools with recommended materials to advance your skills.

Topics include:

Development and testing: Version control, fundamentals of virtualization and containers, testing, and architecture review
Deploying and configuring services: Infrastructure management, networks, security, storage, serverless, and release management
Scaling administration: Monitoring and observability, capacity planning, log management and analysis, and security and compliance

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781492055204Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills