Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Why AIOps is Worth the Investment During an Economic Downturn

Recent talks of an economic softening have left IT leaders concerned about the future of their enterprises. That concern is understandable — tech layoffs create near-daily headlines at this point, with top companies rolling back their operations and rolling up their sleeves to focus on mission-critical expenses. And for many in ITOps, that means cutting tools.

Incident Workflows with Sam Ferguson

PagerDuty’s new Incident Workflows feature will help your teams build powerful, flexible incident response processes customized to your organization’s needs. Add Slack channels, Zoom calls, responding teams, and more. PagerDuty Senior Product Manager Sam Ferguson walks us through how this new featureset works and demonstrates some of the capabilities.

ServiceNow Integration - xMatters Integrations

Looking to extend the value of your existing applications? The xMatters and ServiceNow integration allows organizations to accelerate IT incident response, reduce downtime, and maximize service reliability. Learn some of the most popular ways you can utilize these two industry-leading platforms, including engaging resources and automated technical escalations!

Reporting Incident Using Webforms I Creating Alerts from Outside the Squadcast Ecosystem I Squadcast

Webforms can help stakeholders & the customers of an organization easily report issues. This video explains how users from outside the Squadcast ecosystem can report incidents by filling out a simple form and extend customer support by empowering internal stakeholders and customers to report issues on the go.

How To Setup Outgoing Webhooks in Squadcast | Recieving Incident Information | Squadcast

Webhooks allow you to connect a platform you manage (either an API you create by yourself or a third-party service) to a stream of future events. Setting up a Webhook on Squadcast enables you to receive information (referred to as events) from Squadcast as they happen. This can help you avoid continuously polling Squadcast’s REST APIs or manually checking the Squadcast web/mobile application for desired information.

How to Set up SLOs and Configure SLIs in Squadcast | Tracking Error Budget & Burn Rates | Squadcast

This video will help you define and monitor Service Level Objects for your services and also set up and track error budget burn rates in Squadcast. A Service Level Objective (SLO) is a reliability target, measured by a Service Level Indicator (SLI), and sometimes serves as a safeguard for a Service Level Agreement (SLA). SLOs represent customer happiness and guide the development team’s velocity.

Quick! Grab all the evidence: Capturing application state for post-incident forensics.

Everyone loves a good mystery thriller. Ok, not everyone – but Hollywood certainly does. Whether it’s Sherlock Holmes or Hercule Poirot, audiences clearly enjoy a page-turning plot of hunting down the culprit for some heinous crime.

5 Best practices for developing a culture of continuous improvement

How do you create a great engineering team? Exclusively hire brilliant, tenured computer science PhDs. There we solved it. You can skip the next 400 words. (I can hear my college professor in my head saying “Humor might not be your strong suit”) Building a great engineering team isn’t easy. Understatement of the year. It’s not even a problem to be solved per se. We need to think about it as preparation to solve an infinite set of constantly evolving problems.