Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

7 key processes for running a top performing NOC

Jun 14, 2021 By Eyal Katz In Exigence

Much of the fuel for today’s business organizations is comprised of cloud computing and digital and SaaS applications. So, if something goes wrong with them, there will be a grave impact on productivity, customer satisfaction and even loyalty, as well as on the costs required for resolving the incident, remediating damage, and getting back to business.

Read Post

Exigence

Read more about 7 key processes for running a top performing NOC

Complete Guide to Service Level Objectives (SLOs) That Work

Jun 11, 2021 By Noor-ul-Anam Ruqayya In Blameless

Wondering what Service Level Objectives (SLOs) are? In this article, we will explain service level objectives and how they relate to SLAs, SLIs, and error budgets. A Service Level Objective (SLO) is a reliability target, measured by a Service Level Indicator (SLI) and sometimes serves as a safeguard for a Service Level Agreement (SLA). SLOs represent customer happiness and guide the development team’s velocity.

Read Post

Blameless

Read more about Complete Guide to Service Level Objectives (SLOs) That Work

Here's what SLIs AREN'T

Jun 10, 2021 By Emily Arnott In Blameless

SLIs, or service level indicators, are powerful metrics of service health. They’re often built up from simpler metrics that are monitored from the system. SLIs transform lower level machine data into something that captures user happiness. Your organization might already have processes with this same goal. Techniques like real-time telemetry and using synthetic data also build metrics that meaningfully represent service health.

Read Post

Blameless

Read more about Here's what SLIs AREN'T

BigPanda and xMatters Can Do What??? - xMatters Demo

Jun 10, 2021 By xMatters In xMatters

Have you ever dealt with two or more separate incidents, but something about them seems suspiciously similar? Well, BigPanda and xMatters might just be the toolset you need to start connecting the dots.

View Video

xMatters

Read more about BigPanda and xMatters Can Do What??? - xMatters Demo

The MTTR that matters

Jun 10, 2021 By Robert Ross In FireHydrant

“Mean time to X” is a common term used to describe how long, on average, a particular milestone takes to achieve in incident response. There’s mean time to detect, acknowledge, mitigate, etc. And then there’s the elusive “mean time to recover,” also known as “MTTR.” MTTR, a hotly debated acronym and concept, measures how long it takes to resolve an incident on average. The problem with MTTR, though, is that it doesn’t matter.

Read Post

FireHydrant

Read more about The MTTR that matters

Press Release: iLert achieves Amazon RDS Ready designation

Jun 9, 2021 By iLert In iLert

Cologne, Germany – iLert GmbH, a SaaS company for alerting, on-call management, and uptime monitoring, announced today that it has achieved the Amazon RDS Ready designation, part of the Amazon Web Services, Inc. (AWS) Service Ready Program. This designation recognizes that iLert has demonstrated successful integration with Amazon Relational Database Service (Amazon RDS).

Read Post

iLert

Read more about Press Release: iLert achieves Amazon RDS Ready designation

Faster Incident Resolution with Context Rich Alerts

Jun 9, 2021 By Roshan Shetty In Squadcast

Labelling your alert payloads although simple can significantly improve the time it takes for your team to respond to incidents. In this blog learn how Squadcast's auto-tagging feature can be a game changer by enabling intelligent labelling & routing of alerts to ultimately reduce your MTTR. A frequent problem faced by on-call engineers when critical outages occur is pinpointing the exact point of failure.

Read Post

Squadcast

Read more about Faster Incident Resolution with Context Rich Alerts

AIOps as a modern cockpit, and why that matters

Jun 9, 2021 By BigPanda In BigPanda

Join us in a CTO Perspective discussion with Jason Walker, Chief Customer Officer at BigPanda and former marine pilot, to find out exactly how IT Ops is following in the footsteps of the modern cockpit and why that should matter to anyone looking to adopt AIOps.

View Video

BigPanda

Read more about AIOps as a modern cockpit, and why that matters

AIOps as a modern cockpit, and why that matters

Jun 9, 2021 By Yoram Pollack In BigPanda

Our human capacity for ingesting information and acting on it, is constant. As the systems we operate grow more complex, we need to make sure we use technology that presents us with only the relevant information we need, exactly when we need it. In aviation, this lesson was learned long ago, and now IT Ops is catching up.

Read Post

BigPanda

Read more about AIOps as a modern cockpit, and why that matters

5 Steps to Building an Effective Clinical Communication Plan

Jun 8, 2021 By Christopher Gonzalez In OnPage

Organizations require a well-crafted clinical communication plan to streamline workflows across care teams. The communication plan must include processes, hardware and software that improves how providers perform. An effective communication plan eliminates barriers across departments and ensures that all providers are informed of patient-related incidents. High-level healthcare administrators are responsible for designing, managing and launching the clinical communication plan.

Read Post