© Eyal Shahar  2019
E. ShaharProject Reliability Engineeringhttps://doi.org/10.1007/978-1-4842-5019-8_8

8. Advanced Logging, Monitoring, and Alerting

Eyal Shahar1 
(1)
San Francisco, CA, USA
 

We’ve made dashboards we can use to observe the machine’s behavior in real time, and we created logs that we can review retroactively and trace the origin of issues, if they arise. However, both of these tools require us to be proactive in order to know whether the machine needs our attention. Eventually, we don’t want to babysit the project and wait for it to fail – instead, we want the machine itself to let us know that our attention is needed. Alerting is what happens when the machine lets you know it needs your intervention. For big companies, that’s a huge ...

Get Project Reliability Engineering: Pro Skills for Next Level Maker Projects now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.