%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

A vital alerting solution

Nov 23, 2021 By Matt In SIGNL4

This article should give you a first idea of what SIGNL4 does. What do IT security, production monitoring and technical field service have in common? In all these scenarios, the right people need to get notified immediately – in case of technical malfunctions, urgent maintenance orders or emergencies, all in order to solve any incident quickly and efficiently.

Read Post

SIGNL4

Read more about A vital alerting solution

How We Deploy Product Releases at xMatters

Nov 22, 2021 By Doug Peete In xMatters

With Halloween behind us and the holiday shopping season fast approaching, engineering and product teams know what that means: code freezes! At xMatters, code freezes are a part of our product release process in anticipation of the busiest — and most important — time of the year for many of our customers. But code freezes are just one piece of the puzzle in how we ensure our customers have the most reliable experiences. The way our product releases are designed is much more than that.

Read Post

xMatters

Read more about How We Deploy Product Releases at xMatters

Your xMatters Schedule on Android Devices - xMatters Support

Nov 22, 2021 By xMatters In xMatters

Join Chris Patch, xMatters’ Senior eLearning Specialist, as he teaches you how to view and modify your schedule in the xMatters app on Android devices.

View Video

xMatters

Read more about Your xMatters Schedule on Android Devices - xMatters Support

Partner Integration - Dynatrace with PagerDuty and Rundeck

Nov 22, 2021 By PagerDuty In PagerDuty

Deliver perfect software experiences with real-time intelligence into customer satisfaction and behavior, your applications, and the performance of your hybrid multi-cloud. AI-powered root-cause analysis automatically identifies customer facing performance issues and pinpoints the root-cause within seconds. Open APIs allow ingestion of 3rd party metrics and enable complex system integrations. In this demo, Rob Jahn shares a sophisticated incident remediation workflow incorporating intelligence from Dynatrace, automation in Rundeck, and incidents in PagerDuty.

View Video

PagerDuty

Read more about Partner Integration - Dynatrace with PagerDuty and Rundeck

Building safe-by-default tools in our Go web application

Nov 22, 2021 By Lisa Karlin Curtis In Incident.io

At incident.io, we're acutely aware that we handle incredibly sensitive data on behalf of our customers. Moving fast and breaking things is all well and good, but keeping our customer data safe isn't something we can compromise on. We run incident.io as a multi-tenant application, which means we have a single database (and a single application).

Read Post

Incident.io

Read more about Building safe-by-default tools in our Go web application

4 Ways To Ensure Reliability of Your Digital Services for GivingTuesday

Nov 22, 2021 By Jesse Maddex In PagerDuty

In today’s digital economy, seconds matter. For mission-driven organizations, seconds can be a matter of life and death, and service reliability can make or break access to suicide and safety hotlines, disaster relief, time-critical health care, food assistance, and more. That’s where real-time digital operations comes in.

Read Post

PagerDuty

Read more about 4 Ways To Ensure Reliability of Your Digital Services for GivingTuesday

History of SRE: Why Google Invented the SRE Role

Nov 19, 2021 By JJ Tang In Rootly

A history of Site Reliability Engineering from its origins at Google in 2003 to the present.

Read Post

Rootly

Read more about History of SRE: Why Google Invented the SRE Role

Your xMatters Schedule on iOS Devices - xMatters Support

Nov 19, 2021 By xMatters In xMatters

Join Chris Patch, xMatters’ Senior eLearning Specialist, as he teaches you how to view and modify your schedule in the xMatters app on iOS devices.

View Video

xMatters

Read more about Your xMatters Schedule on iOS Devices - xMatters Support

Training Intelligent Alert Grouping

Nov 18, 2021 By Quintessence Anx In PagerDuty

Complex incidents are both exhausting and commonplace. In this case, incidents that I am referring to as “complex” are incidents that involve multiple, disparate, notifications in your alert management platform. Perhaps these incidents are logically separated because the underlying systems or services were seen as less coupled than they turned out to be in reality.

Read Post

PagerDuty

Read more about Training Intelligent Alert Grouping

Using Predictive Analytics Capability to Resolve Critical Incidents

Nov 18, 2021 By Srinivas Miriyala In Fabrix

CloudFabrix solution provides a holistic approach for enterprises to implement proactive operations with the objective of eliminating/reducing critical incidents and improving customer satisfaction. The solution primarily relies on applying regression/forecasting models on any time-series data to detect and forecast anomalies. One of the unique features of the solution is the ability to convert unstructured data such as logs/incidents/alerts into time-series data to be used for running prediction models.

Read Post