March 2021
Intermediate to advanced
560 pages
9h 55m
English
In this chapter, we’ll cover
• Setting up your logging infrastructure in GCP
• Creating metrics and alerts
• Monitoring your applications for performance, uptime, and overall health
Reliability is the best metric for retaining customers. Knowing this, Google spun up Site Reliability Engineering (SRE), a philosophy similar to DevOps (and oftentimes referred to as a subset or sibling of DevOps), that focuses on leveraging aspects of software engineering and applying them to infrastructure and operations problems.
Even today in most traditional on-premises environments, operations management is typically handled by an IT operations team in charge of infrastructure provisioning, capacity management, cost control, ...
Read now
Unlock full access