Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

New features: Event flows, revamped alert view, sleek reports, and much more

As you know, we've introduced a major update in recent months – ilert Responder – the AI Agent that helps you run root cause analysis during incidents and provides recommendations toward faster resolution. That's not all, and there are way more powerful features to share with you. Feel free to reach out to us via chat or at support@ilert.com if you have questions or if you want to propose a feature or improvement.

FireHydrant MCP Server User Guide

Tips and best practices to help you get up and running with FireHydrant's Model Context Protocol integration. Manage incidents, alerts, and retrospectives directly through AI assistants like Claude or Cursor. Welcome to the FireHydrant MCP Server user guide! This guide will help you get up and running with FireHydrant's Model Context Protocol integration, allowing you to manage incidents, alerts, and retrospectives directly through AI assistants like Claude or Cursor.

How Do I Customize My Service Hotline with SIGNL4's Call Routing?

Many organizations still rely on traditional phone hotlines to provide after-hours support or emergency coverage. While this approach is familiar, it’s often inefficient, hard to scale, and costly. Missed calls, voicemail black holes, or unclear routing logic can lead to delayed responses and frustrated customers. Whether you’re using a third-party service or your own PBX system, the process often requires manual steps, extra tools, or call forwarding rules that aren’t dynamic.

The Quest For The Five Minute Deploy

The Quest For The Five Minute Deploy Speed is everything at incident.io. The faster we can test and ship code, the faster we can get new products and features out to customers. Over the last three years, as our codebase grew and our test suite expanded, we drifted away from our own goals: "We aim for less than 5 minutes between merging a PR and getting it into production." This is the story of how we got back on track.

From Chaos to Control-How PagerDuty and AWS Are Protecting Business Continuity

The recent outage on June 12 proved yet again that service disruptions are inevitable, it’s not a matter of if, but when? And the next question is: how ready are you when that disruption strikes? What sets successful leaders apart is how quickly they are able to recover. Digital businesses are more complex than ever. Teams are managing sprawling cloud environments, microservices architectures, and a dizzying array of third-party integrations.

Being on-call at incident.io

At incident.io, we are building a product that our users rely on 24/7, all year round. This means it is crucial that it is always working, and that is where our on-call rotation comes in. We believe that everyone should be on-call because it tightens the feedback loop between shipping new features and maintaining what we have, leading to more pragmatic engineering decisions.

Learning MCP with PagerDuty

Join PagerDuty's Software Engineers José Côrte-Real and Manuel Reis, and host Daniel Afonso, Senior Developer Advocate, for a dive into Model Context Protocol (MCP) - we'll explore what it is, how it works, and showcase practical use cases in action. Plus, get an exclusive sneak peak at PagerDuty's upcoming open-source MCP server and learn how it can enhance your workflows.