Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Enterprise Alert's Automation Engine: Creating BMC Incidents

Recently we have received a lot of requests for Enterprise Alert to not only alert on critical situations but to also take a proactive approach to initiate, record and track those situations through ITSM tools such as ServiceNow and BMC Remedy. This post will center around what happens when critical systems fail and tickets are not being created in BMC due to a break in the workflow.

Investigating the Scene of an Incident: Using a Time-Traveling Topology to Create Escalation Graphs

Yes, time travel is possible...through data. My ability to time travel began when I started coding at age 10. Back then, all of my code ran on my own little computer. Like many ten-year-olds, I coded to create and play games. I also coded cool graphics to accompany music to impress my friends and utilities for copying. I launched my first commercial website in 1996 and made 25 guilders, which was good money for a 15-year old. Life was so easy.

Chapter Eight: In Which James Embarks on a Service Desk Migration to Improve Incident Management with AIOps

It’s been a month since Dinesh and I humbly high-fived leaving the meeting with Charlie and Lucia and they gave us the green light to roll Moogsoft out across the whole of C&Js and I’m feeling a little weary. Change is hard. I’ve also made it harder on myself by persuading Charlie we should also migrate our service desk solution.

Elephant in the Blameless War Room: Accountability

We’ve always advocated that every company can benefit from a blameless culture . Fostering a blameless culture can profoundly boost your organization in powerful ways, from employee retention to developer velocity and innovation. However, there’s an elephant in the room when we talk about blamelessness with executives: accountability. When things go wrong, people still need to get fired, right?

Threat Stack and Squadcast Integration Streamlines Alerts with Greater Context

This is a guest post collaboration between Squadcast & Threat Stack. The move to the cloud has rapidly expanded the cyber threat surface of modern cloud apps. This blog in partnership with Threat Stack, outlines how you can stay on top of your game with help of context-rich alerting & resolve security incidents rapidly along with few best practices to follow for faster incident response.

Wiley Relies on PagerDuty as the World Moves Towards Digital Learning

John Wiley & Sons, Inc., commonly referred to as Wiley, is a global publishing company founded in 1807 that focuses on academic publishing and instructional materials. Sean Mack, CIO and CISO of Wiley, discusses how PagerDuty is empowering teams to own and support services 24/7/365 as digital learning becomes more prevalent.

xMatters Lunar Lander Release - New Product Features - xMatters Demo

xMatters Lunar Lander release is here! Join Sr. Director of Customer Success, Kerin Munro, and Product Manager, Daniel Reich as they discuss some of the latest and greatest product features that went live with the Lunar Lander release. These updates include new possibilities in xMatters Flow Designer with a create alert step and an incident severity step, updates to Event Flood Control, and more!

3 Steps For A More Strategic Approach to Incident Reduction

When an IT incident negatively impacts employee experience, IT teams rush to remedy the issue – understandably, as a widespread incident can have major effects on employees’ productivity, security, and overall experience. Yet, so many IT teams find themselves drowning in support tickets even as they continue to resolve top call drivers (the incidents that affect the most employees and drive the most support requests).