The Site Reliability Workbook

by Betsy Beyer, Niall Richard Murphy, David K. Rensin, Kent Kawahara, Stephen Thorne

Released July 2018

Publisher(s): O'Reilly Media, Inc.

ISBN: 9781492029502

Book description

In 2016, Googleâ??s Site Reliability Engineering book ignited an industry discussion on what it means to run production services todayâ??and why reliability considerations are fundamental to service design. Now, Google engineers who worked on that bestseller introduce The Site Reliability Workbook, a hands-on companion that uses concrete examples to show you how to put SRE principles and practices to work in your environment.

This new workbook not only combines practical examples from Googleâ??s experiences, but also provides case studies from Googleâ??s Cloud Platform customers who underwent this journey. Evernote, The Home Depot, The New York Times, and other companies outline hard-won experiences of what worked for them and what didnâ??t.

Dive into this workbook and learn how to flesh out your own SRE practice, no matter what size your company is.

Youâ??ll learn:

How to run reliable services in environments you donâ??t completely controlâ??like cloud
Practical applications of how to create, monitor, and run your services via Service Level Objectives
How to convert existing ops teams to SREâ??including how to dig out of operational overload
Methods for starting SRE from either greenfield or brownfield

Publisher resources

View/Submit Errata

Product information

Title: The Site Reliability Workbook
Author(s): Betsy Beyer, Niall Richard Murphy, David K. Rensin, Kent Kawahara, Stephen Thorne
Release date: July 2018
Publisher(s): O'Reilly Media, Inc.
ISBN: 9781492029502