Chapter 10. Using Service Level Objectives for Reliability
This chapter introduces why and how to use Service Level Objectives (SLOs) for ensuring reliability in observable systems. You will learn about the common problems created by traditional monitoring approaches, how those problems are exacerbated in the world of distributed systems, and how an SLO-based approach to monitoring can solve those problems. In the next chapter, we’ll examine why observability makes for actionable alerts when using an SLO-based approach.
In this chapter, we cover an introductory look at SLO-based alerting, how observability is used to respond to those alerts, and a real-world use case of shifting engineering culture to adopt SLOs for monitoring.
Introduction to Service Level Objectives
Service Level Objectives (SLOs) are internal goals for measurement of service health. Popularized by the Google SRE book, SLOs are a key part of setting external service level agreements between service providers and their customers. ...