Service-level indicators
In Chapter 1, Monitoring Fundamentals, we introduced the notion of What to measure, discussing Google's Four Golden Signals, as well as the USE and RED methodologies. Building upon that knowledge, we can start to define service-level indicators (SLIs), which reflect a given service's performance and availability. Constructing queries to generate SLIs is a common pattern of PromQL usage and one of the most useful.
Let's look at an example of an SLI: the typical definition of one is the number of good events over the number of valid events; in this case, we want to understand whether the percentage of requests being served by Prometheus is at or below 100 ms, which makes it a latency SLI. First we need to gather information ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access