Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

A Launch Day in the Life with AI Teammates

Alex, an SRE at Greenagonia, starts the day knowing there’s a big launch coming. Pre-orders suggest a 5-10x increase in normal traffic, which means coffee needs to be extra strong this morning. As Alex scans through overnight alerts, he realizes he’s completely forgotten about a dentist appointment that overlaps with his upcoming on-call shift. Six months ago, this would have meant frantic Slack messages or at least one phone call. Today? Alex’s AI teammate has it covered.

7 Ways Your Incident Management Just Got a Boost (New Feature Rundown)

All the things you may have missed that will make your incident management smarter, faster, and simply easier. We ship updates every week because we want you to get the most out of FireHydrant. But we also know it's hard to stay up to date and read every week's changelog (even though we know reading changelogs is the highlight of your week ).

Experimenting With Different Scripts

It all began when I spun up an AWS t4g.small burstable instance for a side project. Nothing unusual just another day in the cloud. But the moment I connected through SSH, something caught my eye. The system greeted me with a temperature reading of -273.5°C. Wait… what? That’s 0 Kelvin, the point where atomic motion completely stops. In other words, absolute zero , a state that’s theoretically impossible for anything to operate in.

Agentic ITOps: The evolution of AIOps

Enterprise IT departments are struggling to keep up with the dramatic increases in complexity, fragmentation, and chaos in their IT environments. Legacy tools and processes designed for monolithic systems and static infrastructures cannot meet these challenges. Enterprise ITOps requires a more agile and intelligent approach that leverages advances in AI and automation to remain scalable, effective, and sustainable.

The 2025 Guide to Open Source Status Page Software

This is an updated version of the 2024 article. Maintaining transparent communication about service availability is crucial for businesses of all sizes. Status pages are an important part of your communication strategy during times of outages and maintenance events. You can choose to go with a fully managed status page provider or host an open-source one yourself.

Introducing the ilert × Livewatch native integration

We’re excited to announce that ilert now offers a native integration with Livewatch, unlocking seamless incident escalation from monitoring to response. Starting today, all alerts generated by Livewatch can be automatically ingested, grouped, escalated, and managed from within ilert – closing the loop between detection and resolution.

Demo - WhatsApp notifications

Demo – WhatsApp notifications: When generally available, the integration with WhatsApp will allow your PagerDuty notifications to be delivered instantly and reliably via WhatsApp. Security is built in via WhatsApp’s end-to-end encryption per their terms and conditions, so your critical information stays private and protected. With this user-friendly experience, you’ll receive alerts with clear formatting, actionable buttons, and key the context, right inside WhatsApp.