Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on IT Operations Management and related technologies.

PagerDuty Named a Leader and Outperformer in the 2025 GigaOm Radar for AIOps

There’s no shortage of hype around AI in operations, but recognition from a trusted source like GigaOm cuts through the noise. We are excited to share that PagerDuty earned a top spot as a Leader and Outperformer in the 2025 report. It’s recognition that reflects the progress we’ve made in delivering an AI-powered platform that actually helps teams move faster, reduce costs, and operate with confidence in complex environments.

Building the Road for Innovation-PagerDuty and AWS in Action

Every organization wants to innovate, but the reality is that operational friction can grind even the most ambitious plans to a halt. A delayed response here, an inactionable alert there, and suddenly your engineers are spending more time firefighting than building. Context is scattered across tools, and the “big picture” is lost in a sea of alerts and thumbnail-sized dashboards that provide no context or direction.

Beyond Human: AI-Powered Network Operations for the Enterprise

AI doesn’t replace teams. It frees them. AI can be viewed as a digital twin, shouldering the manual load, eliminating low-value work and giving people their time back. In network operations, where every second counts and pressure never lets up, AI becomes the way to rise above the pressing workload. The overwhelming workload isn’t due to teams being incapable, but more because they’re buried in busywork.

Beyond Outages: The Post-Incident Reviews We Should Have Had

In the past year alone, we’ve seen just how much a single outage can disrupt and how much stronger teams become when they learn from it. From the July 16, 2024 incident to the widespread June 2025 outage, it’s clear that incidents are inevitable. The question is: how do you transform each disruption into an opportunity to improve your processes for the next one?

Lessons from the June 12 Outage: Your Operations Are Only as Reliable as Your Incident Management Platform

As digital operations grow increasingly more complex, resilience is no longer optional, it’s essential. The next major outage isn’t a question of if, but when. And when it hits, the gap between true enterprise platforms and brittle point tools will become impossible to ignore.

When the Internet Blinked: What the June 12 Outage Teaches Us About Resilience

On June 12, 2025, the internet blinked. Email vanished, apps froze, and many of us lost contact with our digital coworkers (both AI and human). The world felt it instantly; businesses stalled, teams scrambled, and digital operations everywhere took a hit. Felt a little like deja vu. Does anyone remember July 19, 2024?

PagerDuty Advance and Amazon Q Business announce General Availability of their AI-powered, chat-first integration

When it comes to incident management, the ability to quickly access and act on operational data can mean the difference between brand loyalty and costly downtime. PagerDuty’s integration with the Amazon Q Business index addresses this challenge head-on by providing a seamless, more secure, and faster way to search and access enterprise knowledge across the IT ecosystem.

Engineering Time is Your Most Valuable Asset: Are You Spending It Right?

Technology leaders often face a tempting proposition from their engineering teams: “We could build this ourselves.” It’s a natural instinct, especially when discussing incident management systems. Your team’s confidence isn’t misplaced – they absolutely could build a basic alerting system. However, the question isn’t about capability; it’s about strategic resource allocation and long-term operational excellence.

Beyond Playbooks: Unleashing Enterprise-Wide Automation with Ansible + PagerDuty Runbook Automation

Playbooks are nice. Results are better. This simple truth highlights a critical challenge in modern enterprises: while technical teams have mastered infrastructure automation with Ansible, they need more than just technical playbooks that can only be used by SMEs—they need comprehensive automation that drives measurable business outcomes.