Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Outage Alert: Top 10 Downtime Incidents of 2021

2021 has been an eye-opening year for both businesses and consumers who use popular websites and applications. We have all seen notable increases in the frequency and severity of outages as dependency on internet infrastructure grows – with no signs of slowing down. With our reliance on automation and connectivity expected to increase in 2022 – let’s review some of the top internet outages and website downtime incidents of 2021.

What to Expect From xMatters in 2022

With only a few days left of 2021, we all know what that means: making New Year’s resolutions. While some love the tradition of laying out their goals for the coming 12 months, others loathe it with a passion. And with approximately 80% of people failing to achieve their resolutions, it’s easy to see why there’s so much resentment towards this common habit. At xMatters, we plan to—and often do—beat those odds.

On-Call Escalations

With the AlertOps ServiceNow integration, you can use automatic escalations for on-call schedules and create custom escalations. Automatically escalate to a level 2 or level 3 team and notify management and stakeholders. Set each escalation to use the notification channel you choose (email, voice, SMS, mobile app, and chat). Set your escalations to trigger reminders when a response SLA or a resolution SLA has been breached or is approaching the deadline.

Tips & Tricks: Keeping Track of Event-Processing Delays

A couple of weeks ago our partner Rok Ponikvar from S&T contacted me about an issue one of his customers faced. His customer complained that Enterprise Alert is not alerting on current issues and even if he creates a test ticket in his OBM system no alert goes out. After a little back and forth we concluded that Enterprise Alert is still processing historic data from an Event Storm in OBM earlier that day.

Common Security related Questions and Answers

In light of the recent news about yet another reported Zero-Day Exploit and the accompanying discussions about security, let’s touch on the topic of security audits and how Enterprise Alert can be configured to avoid or at least minimize potential security impact. First, let’s establish what we mean by security audit.

How to Measure Uptime SLOs Using Pingdom and Nobl9

Do you find yourself asking, “What should our first service-level objective (SLO)be?” The simplest way to get started if you have a website is to measure uptime SLOs. The SLO will measure your uptime and how your site compares to your reliability goals. By following the steps outlined here, you can get up and running with your first SLO in minutes. To get started, you’ll need to set up an account on SolarWinds® Pingdom®.