Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Introducing Schedule Rotations: One Schedule, Many Rotations, Total Coverage

When coverage gets complicated, Schedule Rotations keeps it simple. On-call can get real messy, real fast. One minute you’ve got a neat little schedule for the two people rotating primary and secondary. Next thing you know, you’ve got engineers in three time zones, a new hire shadowing incidents, and your “simple” rotation has turned into a board game with no rules. So we fixed it.

Building the Road for Innovation-PagerDuty and AWS in Action

Every organization wants to innovate, but the reality is that operational friction can grind even the most ambitious plans to a halt. A delayed response here, an inactionable alert there, and suddenly your engineers are spending more time firefighting than building. Context is scattered across tools, and the “big picture” is lost in a sea of alerts and thumbnail-sized dashboards that provide no context or direction.

How to Create a Runbook Template That Actually Gets Used

A runbook template is only valuable if your team actually uses it during incidents. Yet many organizations create elaborate documentation that sits untouched in wikis, gathering digital dust while engineers scramble through incidents without guidance. The difference between a runbook that gets used and one that doesn't comes down to practicality, accessibility, and continuous improvement. Let's explore how to create runbook templates that become essential tools rather than checkbox exercises.

9 Best IT Alerting Software in 2025 (Plus 3 Open-Source Options)

I’ve curated a list of 9 best IT alerting software and 3 open-source alternatives for you. Every tool on this list handles the core alerting functions you need: incident detection, fast alert delivery, clear escalation paths, and reliable incident logging. Since all these tools tick those boxes, I focused on what makes each tool special. You’ll find their unique features under “Standout Alerting Features of ” for each option.

Is WhatsApp Safe for Healthcare Communication? Here's What Hospitals in UAE, Israel, and Saudi Are Realizing

At HIMSS this year, in between flashy AI demos and interoperability debates, I kept hearing the same concern from hospital leaders across the UAE, Saudi Arabia, and Israel: “We’re still using WhatsApp for clinical messaging—but it’s starting to feel risky.” Some shared stories of messages getting missed. Others brought up concerns around data privacy and compliance.

Mass Notifications for Local Government: Keeping Residents Informed During Emergencies

When unexpected risks disrupt the health and safety of the public, fast, reliable mass notification systems for local governments are essential. Without them, residents miss critical alerts that protect public health. For example, imagine a scenario like this: A water main break occurs in Waltham at 6:13 am, it took the public works team less than ten minutes to assess the damage and determine that the water is not safe to drink. However, most residents didn’t find out until hours later.

Zoom Video Communications Uses PagerDuty to Keep Video Conferencing Frictionless for Every Customer

Zoom Video Communications is a video conferencing company on a mission to make video communications frictionless for all. Eric Yuan, CEO and founder of Zoom, and Alex Guerrero, Senior Manager of SaaS Operations, dive into why their teams have adopted PagerDuty as their end-to-end incident management platform. Companies trust Zoom for their video conferencing services and, according to Yuan, “Our business counts on PagerDuty.”

Mistakes To Avoid With Your Public Status Page

A public status page forms the public face of your organization's service availability. It is the first point of contact for your customers to check the status of your services during times of crisis. Hence, ensuring the credibility and uptime of your public status page is crucial to your organization's reputation. In this article we will look at the key mistakes to avoid while hosting and managing a public status page.