Chapter 8. Server Monitoring
With many monitoring efforts beginning in the sysadmin/ops engineer team, itâs no wonder that many of us immediately associate âmonitoringâ with âthe thing the sysadmins do.â This is unfortunate since weâve seen thereâs so much more to monitoring than just what happens on a server.
Of course, thereâs an element of truth in the misconception: a lot really does happen on the server! Even in a serverless architecture, there are still servers underneath that provide the platform and all that makes it tick. Weâre going to delve down into what sort of common services youâll encounter on servers these days, what metrics and logs are provided, and how to make sense of it all.
One note before we jump in: this chapter is going to use Linux as the assumed operating system, since thatâs what Iâm most familiar with. For the readers applying these lessons to Windows, nearly all of the stuff weâll be covering is just as applicable to Windows in a general sense, though your tools are different.
Standard OS Metrics
Over the course of this book, Iâve railed against the obsession with the standard OS metrics (CPU, memory, load, network, disk) and for good reason: starting your monitoring work with them is starting with the metrics that offer the least signal of all toward your main concern (that your app is working). In order to know if things are working, you have to start at the top instead, which I covered xref.
However, that isnât to ...
Get Practical Monitoring now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.