Chapter 45. Using ChatOps to Implement Empathy
Daniella Niyonkuru
“Janet is handling an incident lasting over an hour. Another available commander should offer to hand off or override the rest of their shift so Janet can take a break!”
This addition to our ChatOps incident response reminder sequence sounded so simple at first, yet it wasn’t long after that I observed how impactful it was to SREs who were on call. The two-sentence prompt led available commanders to reach out right away and offer to help—and this gesture seemed to reinvigorate the on-call commander.
That’s when I realized that ChatOps, beyond automating and offering an easy interface to manage incident response and infrastructure, also has the power to make our work more sustainable.
SRE teams are often overworked due to the nature of their work. The focus can be so much on systems that the SREs behind them are forgotten. Dealing with too much toil, having night shifts, and constantly being the first line of defense against outages can take a toll on SREs and consequently the systems they work on. In Chapter 44, I discuss how implementing empathy is important and can alleviate the burden, but given that teams are also resource- and time-limited, how best to implement it?
We look to a trusted tool in the SRE toolkit: automation. After doing the difficult work of creating the boundaries, limits, and budgets ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access