Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Choosing SLOs that users need, not the ones you want to provide

Oct 1, 2020 By Squadcast In Squadcast

In our latest two-part series blog, Adam Hammond, talks about how you can build sustainable SLOs that are appropriate for your users, your technology platform, and your business which in turn will help you make your systems robust, your customers happy, and your business boom.

Read Post

Squadcast

Read more about Choosing SLOs that users need, not the ones you want to provide

Efficient Incident Management with Catchpoint and PagerDuty

Oct 1, 2020 By Kameerath Kareem In Catchpoint

The ability to detect and alert performance issues quickly is key to reducing the Mean Time to Resolve (MTTR). Proactive monitoring will catch incidents early on but triggering the right alerts and notifying the relevant incident management team is just as critical. Enterprises rely on multiple disparate tools to monitor different systems so there is a lot of data and noise generated which can render incident management inefficient.

Read Post

Catchpoint

Read more about Efficient Incident Management with Catchpoint and PagerDuty

Refreshing PagerDuty's Navigation for Increased Efficiency and Simplification

Oct 1, 2020 By Ruta Srinivasaraju In PagerDuty

We are super excited to share that we are currently testing and in the process of rolling out a new desktop global navigation to all of our users. Things that are clear in retrospect often emerge from ambiguous and humble beginnings. Initially built as a simple on-call management tool for IT responders, PagerDuty has evolved into an end-to-end, enterprise-grade digital operations platform.

Read Post

PagerDuty

Read more about Refreshing PagerDuty's Navigation for Increased Efficiency and Simplification

New release: Incident Automation just got even better with conditions in FireHydrant Runbooks

Oct 1, 2020 By Dylan Nielsen In FireHydrant

The ability to automate your incident response process means you can start responding to incidents faster. So it’s easy to see why FireHydrant Runbooks is so popular within the platform. When you let automation take over, you can spend more focus fixing problems and keeping your customers happy. Now with the addition of conditions, you can create even more powerful automation.

Read Post

FireHydrant

Read more about New release: Incident Automation just got even better with conditions in FireHydrant Runbooks

How to: Email Incident Stakeholders with conditions in FireHydrant

Oct 1, 2020 By Rich Burroughs In FireHydrant

Our release of conditions in FireHydrant Runbooks has made it easier for teams who rely on email to communicate with key stakeholders or a distribution list. 💡If your team uses Slack, and you haven’t already installed our Slack integration, you should definitely check it out as it’s the easiest way to automate updates to channels when the status of an incident changes.

Read Post

FireHydrant

Read more about How to: Email Incident Stakeholders with conditions in FireHydrant

The Ultimate, Free Incident Retrospective Template

Sep 30, 2020 By Hannah Culver In Blameless

Incident retrospectives (or postmortems, post-incident reports, RCAs, etc.) are the most important part of an incident. This is where you take the gift of that experience and turn it into knowledge. This knowledge then feeds back into the product, improving reliability and ensuring that no incident is a wasted learning opportunity. Every incident is an unplanned investment and teams should strive to make the most of it.

Read Post

Blameless

Read more about The Ultimate, Free Incident Retrospective Template

5 Tips For Better On-Call Support (in 2020)

Sep 30, 2020 By AlertOps In AlertOps

Your enterprise needs on-call support, but it often struggles to achieve its desired results. Yet, the longer your enterprise waits to improve its on-call support processes and procedures, the greater the risk becomes that a minor outage could cause substantial downtime. Bonus Material: Advanced Escalation Example PDF Ultimately, your enterprise needs seamless on-call support processes and procedures.

Read Post

AlertOps

Read more about 5 Tips For Better On-Call Support (in 2020)

FireHydrant demo at Chaos Conf 2020

Sep 29, 2020 By FireHydrant In FireHydrant

FireHydrant CEO, Robert Ross, demos the FireHydrant platform during Chaos Conf 2020.

View Video

FireHydrant

Read more about FireHydrant demo at Chaos Conf 2020

A Closer Look at PagerDuty's New AIOps Capabilities

Sep 29, 2020 By Ariel Russo In PagerDuty

Another PagerDuty Summit is in the books, and we’re still coming down from the excitement and energy our customers and community showed us over the past week. We made several big announcements over the course of the conference, but none more significant than the AIOps advancements on our digital operations platform. We introduced a number of ways customers can apply machine learning algorithms and automation to a wide range of workflows across the platform.

Read Post

PagerDuty

Read more about A Closer Look at PagerDuty's New AIOps Capabilities

Any PLC alarm on your mobile device

Sep 25, 2020 By Derdack In Derdack

Maintenance of machines is an incredibly important task. And it is important to fix a machine before it completely fails. In reactive maintenance scenarios, speed of response is key. Once an issue is detected is important to communicate as reliably and quickly as possible to the right engineer. Ideally, the machine is connected directly to team of mobile engineers in charge and can let them know what exactly happened and what needs to be fixed.

Read Post