Live online training

Site Reliability Engineering Fundamentals

A practical guide to key concepts and best practices

Emil Stolarsky

Jaime Woo

This webcast took place live on:

April 15, 2020 11:00am PT / 2:00pm ET

Sign up to watch the recording.

First name

Last name

Email Company name Job title Phone Country

All fields are required.

Please read O’Reilly’s privacy policy.

You must have JavaScript and Cookies enabled to access this webcast. Click here for Help.

Over the past five years, the ideas behind site reliability engineering (SRE) have caught fire because of their success in improving the reliability of systems. But those just starting their SRE journey often have questions. For instance, how do you transform an existing organization toward SRE? Where do DevOps and SRE overlap, and where do they diverge? And which method for calculating and measuring service-level objectives (SLOs) should you use, and when?

Join Incident Labs’ Emil Stolarsky and Jaime Woo to gain a foundational understanding of SRE principles and the infrastructure practices and processes of a range of organizations—along with actionable advice on putting them to work in your organization. Emil and Jaime will also take you through the pragmatic and sometimes messy decisions that must be made on a regular basis to form a functional and successful SRE culture.

Make meaningful changes to how you run your services immediately, and learn how to start meaningfully participating in the SRE community.

What you’ll learn—and how you can apply it

By the end of this live online course, you’ll understand:

What SRE is (and isn’t) and how it’s evolved over the past decade
How SRE relates to concepts like DevOps and resilience engineering
The benefits of SRE
When and how SRE should be applied for maximum impact
Current SRE conversations and where they’re happening

And you’ll be able to:

Assess how SRE is implemented across various companies of different sizes
Implement foundational SRE concepts, such as SLOs and error budgets
Debunk common myths and misunderstandings around SRE
Evaluate the progress of SRE adoption and strategies and relate them back to stakeholders

This course is for you becauseâ¦

You’re a developer new to or looking to enter an SRE role.
You build the tools that improve deployment, shepherd code from developers into production, make sure it keeps running, or anything else remotely related.
You want to become well-versed in the foundations and best practices of SRE.

Prerequisites:

Experience running software in production environments
Familiarity with the struggle of implementing SRE

Recommended follow-up:

Read Site Reliability Engineering (book)
Read The Site Reliability Workbook (book)
Read Seeking SRE (book)
Watch Spotlight on Cloud: Reducing the Impact of Service Outages with Generic Mitigations with Jennifer Mace (video)
Read Implementing Service Level Objectives (book)

About your instructor

Emil Stolarsky is a site reliability engineer. Previously, he worked on caching, performance, and disaster recovery at Shopify and the internal Kubernetes platform at DigitalOcean. He’s the program cochair for SREcon EMEA 2019 and SREcon Americas West 2020 and contributed a chapter to the O’Reilly book Seeking SRE.
Jaime Woo is an award-nominated writer and a frequent speaker at SREcon EMEA, Americas West, and Americas East. He began his career as a molecular biologist before working at DigitalOcean, Riot Games, and Shopify, where he launched the engineering communications function.

Course outline

SRE fundamentals (55 minutes)

Presentation: Overview of site reliability engineering; What does SRE look like in practice?
Group discussion: Where on the SRE journey is your company?; understanding reliability as a feature
Q&A

Break (5 minutes)

SRE best practices (60 minutes)

Presentation: SLIs, SLOs, SLAs, and error budgets; building a learning organization; What does it take to roll out SRE?
Hands-on exercise: Unlock key SLIs and corresponding SLOs
Q&A

Site Reliability Engineering Fundamentals

Sign up to watch the recording.

Thank you for your registration.

What you’ll learn—and how you can apply it

About your instructor

Course outline