Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

How the technology you choose influences CloudOps maturity

As the world becomes increasingly digital-first, it’s more important than ever for organizations to keep services always-on, innovate quickly, and deliver great customer experiences. Uptime is money, so it’s no surprise that many have made the shift to cloud in recent years in order to make use of its flexibility and scale—while controlling costs. And while 2020 wasn’t easy for any organization, those that are thriving have embraced the digital mindset.

How Squadcast Benefits On-call Engineers - Part 1

It is difficult to stay completely reliable in an always-on world. So it's very important to choose the right Incident Management solution that can solve your problems. In this blog, we have highlighted the benefits of Squadcast and why you should adopt it. “Being on-call sucks!" Often incident response teams use this phrase when talking about their on-call experiences. Despite using best practices for managing infrastructure, incidents do occur from time to time.

Dynatrace and xMatters Make Seamless Efficiency Possible - xMatters Demo

How can organizations integrate their tools into a platform that maximizes uptime and simplifies operations? Is it possible for the tools you already rely on to be more efficient? With Dynatrace and xMatters in tandem, the answer is yes! Join Rob Jahn, Technical Partner Manager at Dynatrace, Eric Maxwell, Solution Architect at xMatters, and Rutuja Rajwade, Partner Marketing Manager at xMatters, as they discuss how Dynatrace and xMatters can work together to make incident management and development processes more efficient.

DevOps & SRE Words Matter: How Our Language has Evolved

As the tech world changes, language changes with it. New technologies will always introduce new terms and descriptions to provide clear understanding. For example, the emergence of the cloud introduced language to describe the changing relationship between servers and clients. Then, of course, product providers will also dictate how their products are to be described, i.e. describing services as “cloud-native”.

WIRES and xMatters: Efficient Collaboration On a National Scale

An update on how xMatters service reliability platform is improving animal rescue response times through WIRES in Australia. We are extremely grateful for xMatters support and are excited to share this update with the xMatters community. We have made so much progress with our wildlife rescue response systems since the devastating bushfires of 2019 and 2020, despite the continuing challenges of COVID-19.

Managed Service Provider - How AlertOps Helps MSP Scale Digital Transformation Initiatives.

In an era where speed, productivity, and user experiences matter most what are the incident management capabilities managed service provider need most to grow, transform and mature their digital operations, processes and serve more organizations, faster and more efficiently. Many of today’s enterprises still have operations that are largely manual, reactive and lack the in-house resources and expertise to undertake a digital transformation initiative.

What's New: Introducing Delay Notifications to Control Alert Fatigue

The OnPage team is pleased to announce a new feature to the enterprise web console: Delay Notifications. With this new addition, organizations have the option to queue messages for specific time periods, delivering messages at the end of the Delay Notification schedule. The latest feature is designed to alleviate alert fatigue and improve work-life balance for incident respondents.

The Top 4 Key Levers to Build Towards Long-Lasting Digital Operations Maturity

Digital operations maturity is a journey. The first step is to understand where you are, where you want to get to, and what’s keeping you from getting there. Only then can you make strategic decisions and lay out a plan for how to approach any hurdles and land where you want your organization to be. For many organizations, upleveling operational maturity requires investment in driving cultural change with fundamental shifts to operating models.

Full-cycle observability with the Elastic Stack and Lightrun

An application running in production is a difficult beast to tame. Most experienced developers–ones who spent enough late nights or Saturday mornings trying to break apart a nasty production bug–will try and create the clearest possible picture for their later selves while writing their code, so that they could understand what’s actually going on in the system during an incident.