Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Inside Prezi's cost-saving switch to Grafana Alerting, Grafana OnCall, and Grafana Incident from PagerDuty

Alexander is Senior SRE at Prezi, a video and visual communications software company. As a team, the Prezi SREs provide multiple services within the company. One of those is the observability stack where Prezi heavily relies on Grafana. Companies are always evolving to run more smoothly, serve their customers better, and operate in a way that is cost-effective.

The connection between incident management and problem management

Sometimes, two concepts overlap so much that it’s hard to view them in isolation. Today, incident management and problem management fit this description to a tee. This wasn’t always the case. For a long time, these two ITIL concepts were seen as distinct—with specialized roles overseeing each. Incident management existed in one corner and problem management in the other. Then came the DevOps movement and the lines suddenly became blurred. So where do they stand today?

Streamlining Incident Management with our latest feature update: Merge Incidents

Hey folks! We‘re back with another nifty feature to your Incident Management tool arsenal. You now have the ability to merge incidents with a few clicks! With this latest update you can reduce the noise while dealing with a complex incident by merging incidents across services under a parent incident. Typically this can occur when multiple incidents stem from the same underlying issue or root cause.

10 Benefits of Effective Incident Communication

In today's digital landscape, most people understand that no system is perfect and data is never 100% safe. Incidents are bound to happen. How people learn about those incidents often influences their reactions. Mishandled incident communication can have drastic consequences for your company. For starters, it can drag out the incident response and harm your bottom line.

Seven Models of Cloud Native Applications

In today's cloud-driven landscape, organizations are transitioning from legacy monolithic systems to agile, scalable, and secure cloud-native solutions. Some are even forging new cloud-native applications. However, the concept of cloud-native design remains subjective, lacking a universal blueprint. This blog aims to provide clarity and guidance for designing precise cloud-native applications and container deployment.

More than downtime: the cultural drain caused by poor incident management

The costs of lackluster incident management are truly far-reaching. We’ve learned they go beyond explicit costs, like lost revenue and labor expenses. And that they go beyond the opportunity cost of engineers being diverted from building revenue-building features. The final area of incident cost that’s often overlooked is cultural drain.

OnPage's Automation in I&O Optimization Predictions (Inspired by Gartner Hype Cycle for I&O Automation, 2023)

The release of the Gartner® Hype Cycle™ for I&O Automation, 2023 has inspired us here at OnPage to provide our insights on the latest trends in I&O optimization. In this blog, OnPage will predict the widespread adoption of technologies that can further automation efforts and thus contribute to I&O optimization.

Sponsored Post

The Future of ITSM: Exploring the Potential of AI-Powered Service Management

IT Service Management (ITSM) is such that it constantly evolves, introducing new technologies and tools. But if you have noticed recently, there have been some constants. And one of the most promising developments is leveraging Artificial Intelligence (AI) to power IT service management. However, the fact that AI has the potential to revolutionize ITSM is not exactly breaking news. But what continues to slip under the radar of many ITOps teams is how to unlock AI's true potential. To know this, there's a dire need to understand the already critical and soon-to-be popular use cases.