Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Beyond Human: AI-Powered Network Operations for the Enterprise

AI doesn’t replace teams. It frees them. AI can be viewed as a digital twin, shouldering the manual load, eliminating low-value work and giving people their time back. In network operations, where every second counts and pressure never lets up, AI becomes the way to rise above the pressing workload. The overwhelming workload isn’t due to teams being incapable, but more because they’re buried in busywork.

Introducing Live Call Routing for Incident Response

Today, we are introducing Live Call Routing, a direct phone line that connects incoming calls to on-call engineers. It captures human-reported incidents that monitoring tools might miss—closing the loop between automated alerts and real-world observations so nothing falls through the cracks. It helps you respond to critical incidents faster by eliminating manual call routing, reducing response times from minutes to seconds.

Live Call Routing - Getting started

Live Call Routing is a direct line that connects incoming calls to on-call engineers. It captures human-reported incidents that monitoring tools might miss—closing the loop between automated alerts and real-world observations so nothing falls through the cracks. It helps you respond to critical incidents faster by eliminating manual call routing, reducing response times from minutes to seconds.

RAISE AI Summit with PagerDuty's Jennifer Tejada and Spotify's Tyson Singer | July 2025

Hear PagerDuty CEO & Chairperson Jennifer Tejada and Spotify’s Tyson Singer, VP of Technology and Platforms on the topic of “Never Miss a Beat: Building Reliable Experiences with AI” at the RAISE AI Summit in Paris on July 9, 2025.

Demo Roundups! Meet the PagerDuty AI Agents

Welcome to the future of operations, where people and agents manage critical work together, driving productivity and efficiency. Learn how PagerDuty’s AI agents can supercharge teams, by autonomously handling repetitive tasks and resolving well-known issues, while surfacing data and insights that augment human expertise for faster resolution and higher operational resilience.

How to Strengthen Your Security Operations with Incident Response Software

When our organization – a mid-sized, fast-scaling technology company specializing in enterprise service management solutions, serving clients in regulated industries like finance and healthcare – faced its first serious cybersecurity breach in early 2024, we realized our incident response management approach wasn’t just outdated – it was putting the business at risk. Back then, we had alerts. We had logs.

Beyond Outages: The Post-Incident Reviews We Should Have Had

In the past year alone, we’ve seen just how much a single outage can disrupt and how much stronger teams become when they learn from it. From the July 16, 2024 incident to the widespread June 2025 outage, it’s clear that incidents are inevitable. The question is: how do you transform each disruption into an opportunity to improve your processes for the next one?