Skip to Content
可观测性工程
book

可观测性工程

by Charity Majors, Liz Fong-Jones, George Miranda
July 2023
Beginner to intermediate
270 pages
4h 48m
Chinese
China Machine Press
Content preview from 可观测性工程
使用
SLO
来提高可靠性
|
115
建立传统的告警机制来监控用户体验,意味着系统工程师必须选择任意的常数来预测何
时体验不佳。例如,这些告警可能会在“
10
个用户经历了缓慢的页面加载时间”或“第
95
百分位数的请求持续超过一定的毫秒”时触发。在一个基于指标的方法中,系统工程
师需要划分出哪些确切的静态措施表明正在发生不可接受的问题。
然而,由于用户在访问过程中可能会在不同的地区以不同形式接入,所以导致一天中系
统的性能变化是非常大的。在流量拥堵期间,当你可能有数百个并发会话时,
10
个用户
经历缓慢的页面加载时间可能是一个重要的指标。但是,当你可能有数以万计的并发会
话正在运行时,这种重要性会在负载高峰期急剧下降。
记住,在分布式系统中故障是不可避免的。小的瞬时故障总是在你没有注意到的情况下
发生。常见的例子包括一个失败的请求,后来重试成功;一个关键的进程启动失败了,
直到它被路由到一个新配置的主机上才启动成功;或者一个服务没有响应了,直到它的
请求被路由到一个备份服务上。这些类型的瞬时故障所带来的额外延迟可能会在流量高
峰期融入正常的操作,但在流量低的时期,
p95
响应时间对单个数据点会更加敏感。
同样,这些例子也说明了基于时间的指标的粗糙性。比方说,
p95
的响应时间是以
5
分钟
的间隔来衡量的。每隔
5
分钟,后
5
分钟间隔的性能报告一个值,如果超过了静态阈值,
就会触发告警。如果这个值超过了阈值,整个
5
分钟的间隔就被认为是坏的(反之,任何
没有超过阈值的
5
分钟间隔都被认为是好的)。在这种类型的指标上发出告警,会导致假 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

可观察性工程

可观察性工程

Charity Majors, Liz Fong-Jones, George Miranda
What Successful Project Managers Do

What Successful Project Managers Do

W. Scott Cameron, Jeffrey S. Russell, Edward J. Hoffman, Alexander Laufer

Publisher Resources

ISBN: 9787111729099