Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Three Teams That Can Use AIOps to Work Smarter, Not Harder

There isn’t a boardroom today that isn’t asking what AI and generative AI in application can help drive efficiency and accelerate their business. For organizations looking to capitalize on ML and automation to improve their efficiency during incidents, AIOps is a tangible, proven application thatproves to be an exciting opportunity for ITOps teams. As we’ve seen across market landscape evaluations, there are a number of ways that solutions can be implemented.

A Practical Guide to Incident Communication

Even the best software fails sometimes. How quickly those failures get addressed, and how your teammates and customers feel about you after the fact, comes down to how well you communicate with them. Users, customer success managers, Ops team members, IT, security, engineering leadership, even the executive team. Each has a vested interest in resolving engineering incidents quickly. All need to be updated with the right information at the right time.

How to use Key-Based Deduplication in Squadcast | Deduplication Rules | Squadcast

Key Based Deduplication is an efficient way to avoid duplicate entries when processing incoming Events alongside existing Incidents. It generates a Deduplication Key using a user-defined template specific to events from an Alert Source. This key helps identify and group duplicates. This video explains how does Key Based Deduplication work and how to set it up effectively.

Helm Dry Run: Guide & Best Practices

Kubernetes, the de-facto standard for container orchestration, supports two deployment options: imperative and declarative. Because they are more conducive to automation, declarative deployments are typically considered better than imperative. A declarative paradigm involves: The issue with the declarative approach is that YAML manifest files are static.
Sponsored Post

Managing On-Call Rotations: Navigating Incident Management from Chaos to Calm

Navigating On-Call rotations can often feel like taming a storm of alerts and constant disruptions, leaving teams overwhelmed and stressed. Hence there is a need to streamline On-Call rotations and leverage concerned software to restore order and peace. In this guide, you'll explore practical tips, best practices, and smart strategies to transform your Incident Management process. Let's get to a more efficient On-Call experience.

The Unplanned Show, Episode 10: Mitra Goswami on Generative AI

In this episode, Mitra shares a bunch of valuable insights in how to successfully adopt generative AI, from selecting use cases that deliver value, having foundational data infrastructure in place, to having design and privacy guidelines. Grab a paper and pen and take some notes!

Demo Roundup: What's new in the PagerDuty Operations Cloud, August 2023

Customer-impacting issues detected and reported by customers anywhere from 20% to 90%+! In this episode of our quarterly demo roundup, we'll see how to quickly take action on a customer-reported issue, with the help of #GenerativeAI and more great new capabilities in the PagerDuty Operations Cloud. Six of PagerDuty’s product managers give live demos.