Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Have You Herd? Episode 2 | Getting into the DevOps Culture

Join the Moogsoft Engineering team for our second episode of Have You Herd?! This episode we talk about how you can get into the DevOps culture covering questions like... How do you contribute to a DevOps culture as an individual contributor? What pipelines and tools should a company have set up before embarking on the DevOps journey? What kind of skills should you have to market as a DevOps leading engineer?

How Integrations Lead to Easier, Quicker and Better Decision-Making

Whether from a monitoring tool such as Datadog, a collaboration tool such as Slack, an automation tool such as Chef or a ticketing tool such as ServiceNow or JIRA, AIOps seamlessly integrates data from all of your IT sources. A robust AIOps solution with integrations can help your DevOps and SRE teams better know where to begin fix problems, resolving incidents before they affect services and reducing downtime.

Why You Need Real-Time for Faster MTTR

“If you ain't first, you're last.” While that famous one-liner from Ricky Bobby (Will Ferrell) in the cult hit Talladega Nights is more joke than catchphrase, it hits home for those of us in the world of DevOps and Observability. Faster is better. And in our technology-driven world of online transactions and complex environments, faster isn’t just better — it’s crucial.

Solarisbank Banks on PagerDuty to Keep Financial Services Online

Solarisbank is Europe’s leading Banking-as-a-Service platform that enables any business to offer their own financial services. Satyajit Ranjeev, Daria Kameneva, and Jens Hermann discuss how PagerDuty helps teams implement a “you build it, you own it” model and reduce incident response times.

Can Emails Initiate xMatters Workflows? - Ask Adam

You’ve spotted an incident, but how do you get your team to start working on it? xMatters workflow expert Adam can show you how. Email triggers in xMatters are a fast and effective, and a great way to get workflows going with minimal fuss. There are a few steps to getting them configured right so let's go through it from the beginning.

How to Introduce Automation to Incident Response with Slack and PagerDuty

Major-incident war rooms are synonymous with stress. Pressure from executives, digging for a needle in a haystack, too much noise—it’s all weight on your hardworking technical teams. Incident responders clearly need a more effective way to collaborate across various technical teams. A method that both minimizes interruptions and keeps stakeholders up to date while ensuring everyone has the right level of context to do their job.

Leverage Observability With OpenTelemetry to Understand Root Cause Quickly

An observability solution should help any incident responder understand what changed and why. A lot has been written on the difference between monitoring and observability, but an easy way to understand how both are integral to incident response is to consider how customers use PagerDuty—with both monitoring and observability tools—to get to the right answer.