Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

July 2022 update - Remote actions for super-fast incident remediation

Our July update ships a very powerful new feature – remote actions. Remote actions are available for execution – once configured – in the SIGNL4 mobile app and allow you to quickly perform remediation actions without having to fire up a notebook and VPN or without using a desktop PC. So, genuine anywhere remediation comes true. As always, you can find all the details in this blog article.

What I learned from leading my first incident

A few weeks ago we had a major incident. We were releasing our Practical Guide to Incident Management, and after posting about it online an incident.io employee noticed that the page wasn’t loading. Just to set the scene, I’ve been at incident.io for 3 months and don’t have any experience of incidents in my previous role. When the team got paged I expected this to be one of those “follow along and learn how the wizards work their magic” exercises.

AlertOps is in the ConnectWise's 2022 PitchIT Accelerator Program!

PitchIT is a competition for MSP innovators. The program is designed to showcase potential offerings that could be built or integrated into the ConnectWise platforms. It’s a 16-week accelerator program where AlertOps and the other participants will go through a rigorous business assessment, gain coaching from industry experts, earn placement on the ConnectWise marketplace, engage in co-marketing and more.
Sponsored Post

Top Five Pitfalls of On-Call Scheduling

On-call schedules ensure that there's someone available day and night to fix or escalate any issues that arise. Using an on-call schedule helps keep things running smoothly. These on-call workers can be anyone from nurses and doctors required to respond to emergencies to IT and software engineering staff who need to fix service outages or significant bugs. Being on-call can be challenging and stressful. But with the proper practices in place, on-call schedules can fit well into an employee's work-life balance while still meeting the organization's needs.

Why More Incidents Are Better

Ask most SREs how many incidents they’d have to respond to in a perfect world, and their answer would probably be “zero.” After all, making software and infrastructure so reliable that incidents never occur is the dream that SREs are theoretically chasing. Reducing actual incidents by as much as possible is a noble goal. However, it’s important to recognize that incidents aren’t an SRE’s number one enemy.

Why Operational Maturity Helps Businesses Reduce the Great Resignation Trend

The past few years have led to fundamental business and cultural shifts for both companies and employees. Covid-19 has brought opportunities for companies who invested early in digital operations, while others struggled to maintain the status quo. The latter gave rise to record employee burnout, and what is now commonly referred to as the Great Resignation.