·3 min read
After an incident, MTTR is the wrong thing to brag about
Mean time to recovery is easy to measure and easy to game. What to track after an incident instead, so your postmortems actually change something.
Posts tagged
Product updates, incident management notes, and lessons from building Vigiles.
Mean time to recovery is easy to measure and easy to game. What to track after an incident instead, so your postmortems actually change something.
Alert fatigue is a trust problem, not a volume problem. Why false positives erode your team, and how confirming failures across locations fixes it.