Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

The Debrief: Incident management for data teams

If you're on a data team, have you ever considered using an incident management tool to respond to pipeline issues? If the answer is no, then you might want to check out this episode. Here, we chat with Jack, Data Analyst at incident.io, to better understand why data teams can—and should—look to incident management tools like incident.io to manage issues. We chat about.

The Debrief: A year in review-2023 at incident.io

What a year 2023 was at incident.io! While it's hard to summarize 365 days, a few things stand out: So as we close the curtain on 2023, we sat down with the three co-founders of incident.io to do a bit of reflection on the wild ride that was this year. In this episode you'll hear them discuss challenges, big wins, moments of growth, what's next for us, and most importantly, what the three co-founders like most about one another.

How To View Previous Incidents To Gain Helpful Context During Incident Triage?

Picture this: you're knee-deep in resolving a P1/P0 incident, urgently seeking answers. What if you could tap into past incidents to get important incident insights and streamline your troubleshooting process? In this blog, we pitch into the practical aspects of leveraging Squadcast's Past Incidents feature to help you enhance your Incident Management process.

Setting the foundations for on-call that's fair, balanced, and human-focused

Whenever you're providing a service to businesses or individuals that they rely on, it's important to make sure that it's up and running as much as possible without disruptions. But the reality is that, despite your best efforts, downtime does happen. Regardless of when incidents strike, whether it’s 2 PM in the middle of the working day or 2 AM, it's important to have people available to diagnose and resolve issues as soon as possible.

What is a multi-cloud management platform?

As an IT leader, you’re acutely aware of the struggles of juggling multiple cloud environments, from integration headaches to holistic incident management to monitoring multiple clouds at once. Seeking a more efficient multi-cloud management solution is crucial to alleviate these pressures and streamline your cloud operations.

Episode 23: Zero-Downtime Updates with Todd Whitney

With limited error budgets and low user tolerance for maintenance window, the ability to execute routine updates without a maintenance window is an increasingly important socio-technical capability. Hear from Todd Whitney, who recently spoke at HashiConf about how PagerDuty performs updates while upholding its promise to customers of taking zero maintenance windows.

How MSPs and MSSPs can reduce risk and liability for their clients

For 83% of companies, a cyber incident is just a matter of time (IBM). And when it does happen, it will cost the organization millions, coming in at a global average of $4.35 million per breach. Add to that stringent data protection laws and the growing frequency and reach of ransomware and other sophisticated attacks.