Software Reliability Engineering

SREsyphus

Maybe, one of these days, my SRE team can define SLOs that actually make sense. This "alert on whatever the customer wants" stance really really sucks.

A flow chart depicting a circular process. "why didn't you catch that incident" > Tune alerts to be more sensitive. > "Why are our alerts so noisy" > Tune alerts to be less noisy > and repeat.
ALT