September 2023

How incident.io enables the confidence to declare more incidents

Sep 29, 2023 By Incident.io In Incident.io

In this snippet, Alon Levi, VP of Engineering at WorkOS, talks about how his team has gained the confidence to declare more incidents with incident.io.

View Video

Incident.io

Incident Management

Read more about How incident.io enables the confidence to declare more incidents

How incident.io's intuitive UI is great for non-technical responders

Sep 29, 2023 By Incident.io In Incident.io

In this snippet, Alon Levi, VP of Engineering at WorkOS, talks about how non-technical responders have been able to confidently declare incidents thanks to incident.io's intuitive UI.

View Video

Incident.io

Incident Management

Read more about How incident.io's intuitive UI is great for non-technical responders

How WorkOS has benefitted from incident.io's high-quality support

Sep 29, 2023 By Incident.io In Incident.io

In this snippet, Alon Levi, VP of Engineering at WorkOS, talks about the quality of support his team has received from incident.io.

View Video

Incident.io

Incident Management

Read more about How WorkOS has benefitted from incident.io's high-quality support

How incident.io helped WorkOS transform its incident response featuring VP of Engineering, Alon Levi

Sep 29, 2023 By Incident.io In Incident.io

View Video

Incident.io

Incident Management

Read more about How incident.io helped WorkOS transform its incident response featuring VP of Engineering, Alon Levi

Better learning from incidents: A guide to incident post-mortem documents

Sep 27, 2023 By Luis Gonzalez In Incident.io

If you’re just starting out in the world of incident response, then you’ve probably come across the phrase “post-mortem” at least once or twice. And if you’re a seasoned incident responder, the phrase probably invokes mixed feelings. Just to clarify, here, we’re talking about post-mortem documents, not meetings. It’s a distinction we have to make since lots of teams use the phrase to refer to the meeting they have after an incident.

Read Post

Incident.io

Read more about Better learning from incidents: A guide to incident post-mortem documents

Clouds, caches and connection conundrums

Sep 26, 2023 By Ben Wheatley In Incident.io

We recently moved our infrastructure fully into Google Cloud. Most things went very smoothly, but there was one issue we came across last week that just wouldn’t stop cropping up. What follows is a tale of rabbit holes, red herrings, table flips and (eventually) a very satisfying smoking gun. Grab a cuppa, and strap in. Our journey starts, fittingly, with an incident getting declared... 💥🚨

Read Post

Incident.io

Read more about Clouds, caches and connection conundrums

incident.io workflows and integrations - as told by Pleo

Sep 23, 2023 By Incident.io In Incident.io

View Video

Incident.io

Incident Management

Read more about incident.io workflows and integrations - as told by Pleo

How we've made Status Pages better over the last three months

Sep 22, 2023 By Asiya Gorelik In Incident.io

A few months ago we announced Status Pages – the most delightful way to keep customers up-to-date about ongoing incidents. We built them because we realized that there was a disconnect between what customers needed to know about incidents, and how easily accessible this information was. For example: As we built them, we focused on designing a solution that powered crystal-clear communication, without the overhead — all beautifully integrated into incident.io.

Read Post

Incident.io

Read more about How we've made Status Pages better over the last three months

How incident io thinks about learning from incidents

Sep 21, 2023 By Incident.io In Incident.io

A overview of how incident.io thinks about incidents, and how they promote learning in a smaller organisation.

View Video

Incident.io

Incident Management

Read more about How incident io thinks about learning from incidents

The struggles of actually applying incident theory

Sep 21, 2023 By Incident.io In Incident.io

Chris explains his thoughts on the theory of learning from incidents, and why work needs to be done to close the gap and help folks actually trying to get their job done.

View Video

Incident.io

Incident Management

Read more about The struggles of actually applying incident theory

What's wrong with MTTR?

Sep 21, 2023 By Incident.io In Incident.io

Taken from our a full debrief on "Learning from incidents is not the goal", Chris walks through MTTR, the justifiable bad rap it has, and his thoughts on it as a measure.

View Video

Incident.io

Incident Management

Read more about What's wrong with MTTR?

Active and passive learning from incidents

Sep 21, 2023 By Incident.io In Incident.io

In this video, Chris shares his thoughts on the difference between active learning: writing and sharing debriefs, meeting to walk through an incident, etc., and passive learning: running incidents in the open, dynamic collaboration, reviewing past incidents.

View Video

Incident.io

Incident Management

Read more about Active and passive learning from incidents

The Debrief: Learning from incidents is not the goal

Sep 21, 2023 By Incident.io In Incident.io

In this video, incident.io co-founder and CPO Chris Evans walks through his blog post "Learning from incidents is not the goal". We cover why he wrote this, his thoughts on the gap between theory and practice, and how people can really learn from incidents.

View Video

Incident.io

Incident Management

Read more about The Debrief: Learning from incidents is not the goal

The balancing act of reliability and availability

Sep 19, 2023 By incident.io In Incident.io

As consumers, we expect the products and software we buy to work 100% of the time. Unfortunately, that’s impossible. Even the most reliable products and services experience some disruption in service. Crashes, bugs, timeouts. There are a ton of contributing factors, so it's impossible to distill disruptions down to a single cause. That said, technology is becoming more and more sophisticated, and so is the infrastructure that supports it.

Read Post

Incident.io

Read more about The balancing act of reliability and availability

The connection between incident management and problem management

Sep 15, 2023 By Luis Gonzalez In Incident.io

Sometimes, two concepts overlap so much that it’s hard to view them in isolation. Today, incident management and problem management fit this description to a tee. This wasn’t always the case. For a long time, these two ITIL concepts were seen as distinct—with specialized roles overseeing each. Incident management existed in one corner and problem management in the other. Then came the DevOps movement and the lines suddenly became blurred. So where do they stand today?

Read Post

Incident.io

Read more about The connection between incident management and problem management

Practical guidance for getting started as a site reliability engineer

Sep 8, 2023 By Ben Wheatley In Incident.io

At the beginning of May, I joined incident.io as the first site reliability engineer (SRE), a very exciting but slightly daunting move. With only some high-level knowledge of what the company and its systems looked like prior to this point, it’s fair to say that I didn’t have much certainty in what exactly I’d be working on or how I’d deliver it.

Read Post

Incident.io

Read more about Practical guidance for getting started as a site reliability engineer

Operations | Monitoring | ITSM | DevOps | Cloud

September 2023

How incident.io enables the confidence to declare more incidents

How incident.io's intuitive UI is great for non-technical responders

How WorkOS has benefitted from incident.io's high-quality support

How incident.io helped WorkOS transform its incident response featuring VP of Engineering, Alon Levi

Better learning from incidents: A guide to incident post-mortem documents

Clouds, caches and connection conundrums

incident.io workflows and integrations - as told by Pleo

How we've made Status Pages better over the last three months

How incident io thinks about learning from incidents

The struggles of actually applying incident theory

What's wrong with MTTR?

Active and passive learning from incidents

The Debrief: Learning from incidents is not the goal

The balancing act of reliability and availability

The connection between incident management and problem management

Practical guidance for getting started as a site reliability engineer

Monthly Archive

Follow Us