%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Making Go errors play nice with Sentry

Apr 6, 2022 By Lawrence Jones In Incident.io

Here at incident.io, we provide a Slack-based incident response tool. The product is powered by a monolithic Go backend service, serving an API that powers Slack interactions, serves an API for our web dashboard, and runs background jobs that help run our customers incidents. Incidents are high-stakes, and we want to know when something has gone wrong. One of the tools we use is Sentry, which is where our Go backend send its errors.

Read Post

Incident.io

Read more about Making Go errors play nice with Sentry

Four Use Cases for Optimizing Your Cloud Operations With PagerDuty Runbook Automation

Apr 6, 2022 By Madeline Stack In PagerDuty

The cloud is easy and powerful—until it’s not. Once companies have customers, commitments, and compliance concerns, they often have to create cloud operations teams to manage the cloud on behalf of their fellow employees. Often, organizations that migrate to the cloud find themselves hampered by inefficient cloud operations if they haven’t standardized their IT procedures for operability.

Read Post

PagerDuty

Read more about Four Use Cases for Optimizing Your Cloud Operations With PagerDuty Runbook Automation

Nobl9 integration with PagerDuty

Apr 6, 2022 By PagerDuty In PagerDuty

"Create alerts with meaning. Proactively notify your on-call site reliability manager through Pager Duty when SLO thresholds are close to being reached or error budgets are burning faster than planned.

View Video

PagerDuty

Read more about Nobl9 integration with PagerDuty

Prefect integration with PagerDuty

Apr 6, 2022 By PagerDuty In PagerDuty

"Supercharge data workflow event notifications with PagerDuty.Prefect’s PagerDuty integration enables Prefect users to leverage PagerDuty’s comprehensive notifications and IT workflow platform to monitor data pipelines so you can be alerted about issues and resolve them quickly.

View Video

PagerDuty

Read more about Prefect integration with PagerDuty

Keep Stakeholders Informed During Major Incidents

Apr 6, 2022 By xMatters In xMatters

During major incidents, it’s crucial that all stakeholders are provided with the status updates they need. Those communications however need to be tailored to what the stakeholder actually needs, and provided in a streamlined format that works best for them. Just like alert fatigue, communication fatigue can be detrimental during an outage or other service reliability issue.

Read Post

xMatters

Read more about Keep Stakeholders Informed During Major Incidents

What BigPanda's recent funding means for our customers

Apr 5, 2022 By Matt Peloso In BigPanda

The effects of BigPanda’s most recent round of funding—amounting to $190 million—will be reverberating throughout the company for years to come. And it’s not just BigPanda employees who have experienced a surge of enthusiasm in the wake of our Unicorn status. Our customers are thrilled at the prospect of more innovation from our team and new products that help them automate and evolve.

Read Post

BigPanda

Read more about What BigPanda's recent funding means for our customers

Incident management best practices: before the incident

Apr 5, 2022 By Robert Ross In FireHydrant

When incidents inevitably occur in your software stack, managing them well could be the difference between losing customers and building trust with them. In this article, we’ll give you and your team some best practices on how to prepare for managing incidents. It’s crucial to define service ownership, a declaration process, and practice all of it. With a little planning now, you'll be able to cut your incident response time drastically.

Read Post

FireHydrant

Read more about Incident management best practices: before the incident

Freshdesk + Squadcast: Enabling Streamlined Incident Response for Enterprises

Apr 5, 2022 By Nir Sharma In Squadcast

Freshdesk is a cloud-based customer service platform used by enterprises that provides a centralized help desk(with the help of support tickets) across multiple channels, including email, phone, chat, and social media. Squadcast is an incident management platform that integrates with major monitoring, ChatOps and project management tools to provide a centralized place for reliability.

Read Post

Squadcast

Read more about Freshdesk + Squadcast: Enabling Streamlined Incident Response for Enterprises

April 2022 Update - Signl categories and duty scheduler improvements

Apr 5, 2022 By René In SIGNL4

With our April update, we ship some great improvements for Signl categories, category-based alerting and duty scheduling. All details are available in this blog article.

Read Post

SIGNL4

Read more about April 2022 Update - Signl categories and duty scheduler improvements

Show character with Blameless Postmortems (part one)

Apr 4, 2022 By Dave Harrison In Raygun

This is Part 1 of a two-part series on Blameless Postmortems. Today, we'll discuss why blameless postmortems are so important and their implications for your team; the second part will go into detail on how to set them up as a process and make them successful. Somebody wise may have once told you that how we handle adversity shows our character. Being able to acknowledge and admit mistakes is the first step towards learning - it's a key part of success both in personal relationships and in large companies.

Read Post