book

Cloud Native Monitoring

by Kenichi Shibata, Rob Skillington, Martin Mao

April 2022

Intermediate to advanced

57 pages

1h 21m

English

O'Reilly Media, Inc.

Read now

Unlock full access

1. The Three Phases of Observability: An Outcomes-Focused Approach
Remediating at Any PhaseThe Three Phases Illustrated
2. Why Do You Need Metrics?
Metrics as a Starting PointThe Case for MetricsMetrics Provide an Efficient Snapshot of the SystemMetrics Are Easy to UseMetrics Let Us Aggregate Data QuicklyMetrics Allow Us to Create Alerts Metric Data Is Growing in ScaleUnderstanding Cardinality and Dimensionality Cloud Native Systems Are Flexible and EphemeralCloud Native Services and Systems Have Greater InterdependenciesThe Risk of Losing Focus on Outcomes
3. The Rise of Open Source Metrics
User-Controlled Metric Data Collection Prometheus: The Good and the Not-So-GoodHorizontal Scalability Self-Managed Remote Storage OptionsFully Managed SaaS OptionsChoosing Between Self-Managed and Fully Managed
4. Strategies for Controlling Metric Data Growth
RetentionResolutionApplying Resolution and Retention in M3AggregationApplying Aggregation in M3Conclusion
5. Building Great Metrics Functions
Out-of-the-Box Standard Instrumentation and DashboardingAdding Business Context to Standardized MetricsCreating SLOs from Standardized InstrumentationMonitor the Monitor Write and Read Limits Safe Ways to Experiment and Iterate
Conclusion
About the Authors

Content preview from Cloud Native Monitoring

Chapter 1. The Three Phases of Observability: An Outcomes-Focused Approach

The cloud native ecosystem has changed how people around the world work. It allows us to build scalable, resilient, and novel software architectures with idiomatic backend systems by using the power of the open source ecosystem and open governance.

How does it do that? Distributed architectures. The introduction of containers made the cloud flexible, and empowered distributed systems. However, the ever-changing nature of these systems can cause them to fail in a multitude of ways. Distributed systems are inherently complex, and, as systems theorist Richard Cook notes, “Complex systems are intrinsically hazardous systems.”¹

Think about how many different hazards a container faces: it can be terminated, it can run out of memory, it can fail the readiness probes, or its pods can be evicted from a restarting node, to name a few. These additional complexities are a trade-off for highly flexible, scalable, and resilient distributed architectures.

Distributed systems have many more moving parts. The constant struggle for high availability means that, more than ever, we need observability: the ability to understand changes within a system.

Thanks in large part to Cindy Sridharan’s concept of “three pillars of observability,” introduced in her groundbreaking work Distributed Systems Observability,² many people think that if you have logs, traces, and metrics (Figure 1-1), you have observability. Let’s look quickly ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Managing Cloud Native Data on Kubernetes

Publisher Resources

ISBN: 9781098126919

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Cloud Native Monitoring

by Kenichi Shibata, Rob Skillington, Martin Mao

Chapter 1. The Three Phases of Observability: An Outcomes-Focused Approach

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.