·Ankit Mehta·3 min read

How to communicate during an outage without making it worse

When your service is down, your customers are not only frustrated that it broke. They are frustrated that they cannot tell whether you know. Silence reads as either you have not noticed or you do not care, and both are worse than the outage itself.

The instinct during an incident is to say nothing until you have answers. Hold the update, look more in control. It is the wrong instinct. A short, honest note that you are aware and looking into it buys more goodwill than a perfect explanation an hour later.

Say something before you know everything

You do not need a root cause to post an update. You need to confirm three things. You know there is a problem. You are working on it. You will say more soon.

That is enough to stop the support queue from filling with the same question, and it changes the tone of the whole event. A customer who can see you working will wait. A customer staring at a page that says everything is fine while nothing works will not.

Post early, even when the message is thin. You can add detail as you learn it.

Write for the person who is blocked

During an outage, your update is being read by someone who cannot do their job right now. They do not want reassurance and they do not want jargon. They want to know what is affected, whether their part is affected, and roughly when to check back.

Name the impact in plain terms. Say which service is degraded and what that means for them. Skip the internal detail about which pod restarted. Give them a reason to stop refreshing, like a time you will post the next update, and then keep that promise.

Close it out properly

The update most teams forget is the last one. The service recovers, everyone exhales, and the status page sits on investigating for two more days.

Mark it resolved, say how long it lasted, and say in one line what happened. You do not owe anyone a full postmortem on the status page, but a short honest recap turns an outage into a sign that you handle problems well. People remember how you communicated far longer than they remember the downtime.

Make it the easy path

None of this happens during an incident if it is extra work. The team is busy fixing the thing. Communication only happens reliably when it is built into the same flow.

That is why status pages in Vigiles run off the same incidents your monitors create. When an incident opens, the page can reflect it. When it resolves, the recovery shows up with the duration already filled in. Customers subscribe and get the updates without anyone copying text between tools. The honest update becomes the easy update, which is the only kind that actually gets sent.

A good outage is one your customers felt informed through. Vigiles ties status pages to your live incidents, so communicating is part of the response, not a separate job. Start free, or see how it all fits together.