Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

5 Ways to Improve On-call Management (So Nothing Falls Through the Cracks)

Your enterprise has IT team members “on call,” so you can get immediate support with downtime, outages, and similar issues. That’s why streamlining on-call management may dictate your IT team’s success. Bonus Material: Advanced Escalation Example PDF To understand why, consider what will happen if a network or system crashes but IT team members cannot quickly and effectively communicate with one another.

New Uptrends integration with Opsgenie

You and your team have a lot of things begging for your attention. You’ve got multiple systems in place, and if anything goes wrong, the last thing you need is a storm of notifications coming at you from everywhere. To help you centralize your messaging and incident management, Uptrends continues to add integrations with tools that your team may already use. So, if you use Opsgenie, this new integration is for you.

Here are the Important Differences Between SLI, SLO, and SLA

When embarking on your SRE journey, it can seem daunting to decipher all the acronyms. What are SLOs versus SLAs? What’s the difference between SLIs and SLOs? In this blog post, we’ll cover what SLI, SLO, and SLA mean and how they contribute to your reliability goals.

How SLOs Enable Fast, Reliable Application Delivery

Application delivery is getting harder each day with the rise in complexity, the demand for services to be always-available, and the increasing pressure on teams to innovate. Service level objectives, or SLOs, can help. In this blog, we’ll discuss how SLOs are the key to modern application delivery, how to manage and measure them, the importance of observability for your SLO solution, and how to begin the journey to reliable application delivery today.

Extend the Power of Your Teams With PagerDuty's ServiceNow Integration Update

You asked and we’re delivering! We’re introducing several new and exciting features to PagerDuty’s ServiceNow integration that you, our customers, have requested. Our most anticipated new feature utilizes ServiceNow CMDB (Configuration Management Database) data to easily build service hierarchies in PagerDuty through business service dependencies.

What is a Kubernetes Operator and Why it Matters for SRE

Kubernetes is an open-source project that “containerizes” workloads and services and manages deployment and configurations. Released by Google in 2015, Kubernetes is now maintained by the Cloud Native Computing Foundation. Since its release, it has become a worldwide phenomenon. The majority of cloud native companies use it, SaaS vendors offer commercial prebuilt versions, and there’s even an annual convention!