Skip to Content
Site Reliability Engineering Fundamentals
on-demand course

Site Reliability Engineering Fundamentals

with Emil Stolarsky, Jaime Woo
December 2021
Intermediate
1h 48m
English
O'Reilly Media, Inc.
Closed Captioning available in German, English, Spanish, French, Italian, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional)

Overview

Over the past five years, the ideas behind site reliability engineering (SRE) have caught fire because of their success in improving the reliability of systems. But those just starting their SRE journey often have questions. For instance, how do you transform an existing organization toward SRE? Where do DevOps and SRE overlap, and where do they diverge? And which method for calculating and measuring service-level objectives (SLOs) should you use, and when?

Join Incident Labs’ Emil Stolarsky and Jaime Woo to gain a foundational understanding of SRE principles and the infrastructure practices and processes of a range of organizations—along with actionable advice on putting them to work in your organization. Emil and Jaime will also take you through the pragmatic and sometimes messy decisions that must be made on a regular basis to form a functional and successful SRE culture.

Make meaningful changes to how you run your services immediately, and learn how to start meaningfully participating in the SRE community.

What you’ll learn and how you can apply it

By the end of this recording of a live online course, you’ll understand:

  • What SRE is (and isn’t) and how it’s evolved over the past decade
  • How SRE relates to concepts like DevOps and resilience engineering
  • The benefits of SRE
  • When and how SRE should be applied for maximum impact
  • Current SRE conversations and where they’re happening

And you’ll be able to:

  • Assess how SRE is implemented across various companies of different sizes
  • Implement foundational SRE concepts, such as SLOs and error budgets
  • Debunk common myths and misunderstandings around SRE
  • Evaluate the progress of SRE adoption and strategies and relate them back to stakeholders
This live event is for you because…
  • You’re a developer new to or looking to enter an SRE role.
  • You build the tools that improve deployment, shepherd code from developers into production, make sure it keeps running, or anything else remotely related.
  • You want to become well-versed in the foundations and best practices of SRE.

Prerequisites

  • Experience running software in production environments
  • Familiarity with the struggle of implementing SRE

Recommended follow-up:

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Site Reliability Engineering

Site Reliability Engineering

Niall Richard Murphy, Betsy Beyer, Chris Jones, Jennifer Petoff
Site Reliability Engineering: How Google Runs Production Systems

Site Reliability Engineering: How Google Runs Production Systems

Betsy Beyer, Chris Jones, Jennifer Petoff, Niall Richard Murphy
Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley
Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley

Publisher Resources

ISBN: 0636920668534