Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

2025 Starts Here: PagerDuty Innovations to Help You Tackle What's Next

As we enter 2025, we reflect on a year committed to innovation and customer success at PagerDuty. In 2024, we introduced capabilities that empowered operations teams to mitigate risks, protect customer trust, and improve business outcomes. From managing global outages to addressing complex digital operations, the PagerDuty Operations Cloud enabled organizations to respond faster, work smarter, and build operational resilience.

Enrich your on-call experience with observability data at your fingertips by using Datadog On-Call

The stress, sudden disruptions, and high stakes of resolving issues while on call is one of the most challenging aspects of an engineer’s job. Many organizations, from startups to large enterprises, still struggle with their on-call experience, which leads to longer resolution times and lower employee retention rates. Constant context switching, managing multiple tools, and racing against time to resolve issues can cause frustration, burnout, and inefficiency.

The Impact of Artificial Intelligence on Modern Software Development

Artificial intelligence (AI) is reshaping industries, and software development is no exception. By integrating AI technologies like machine learning, generative AI, and natural language processing, development teams can optimize workflows, enhance code quality, and reduce time-to-market. In this article, we’ll examine AI in software development, including its benefits, challenges, and most recent developments. Let’s get started.

Notify clients about incidents using AI

During the heat of incident response, staying focused on resolving the issue quickly is essential. Crafting clear and accurate incident updates, however, can be challenging under pressure. That’s where ilert’s AI-powered incident communication feature makes all the difference. This feature is a part of the ilert AIOps add-on.

xMatters Yars' Revenge Release

If you’re not an expert in destroying energy shields, dodging enemy swirls, or using space cannons to avenge your home planet like players in Yars’ Revenge, don’t worry! Our latest release is here to help you focus on fighting incidents that are a little more down to earth! Let’s take a look at some of the new features you’ll find in your incident-fighting arsenal.

How data habits help build a data culture

It's no secret that building a data-driven culture in a company is hard, but what is it exactly that makes this such a tricky endeavor? Contrary to popular belief, technology isn't the main hurdle. A recent survey reveals that only a quarter of respondents cite technological limitations as the primary obstacle to becoming data-driven.

What is Alerting?

What is Alerting? Alerting is a central component of modern safety and operating concepts. It is used to act quickly and effectively in hazardous situations. From operational alerting in operations management to alerting the population, there are various scenarios that cover specific requirements and areas of application. In this article, we provide an overview of the various alerting methods and their significance.

The three pillars of observability

Do you feel you’re always playing catch-up with incidents? If so, you’re not alone. As IT environments become more complex, alerts keep piling up, and finding the root cause feels like searching for a needle in a haystack. And ITOps and incident responders are left scratching their heads and wondering: what went wrong? It can be frustrating when you don’t have end-to-end visibility into your systems. This is where observability comes in.

Kickstart your investigations and reduce alert noise with Doctor Droid's offering in the Datadog Marketplace

Being an on-call engineer is often overwhelming, requiring you to pivot between tickets, dashboards, runbooks, and different data sources as you try to separate legitimate incidents from unnecessary noise. Not only does the process of investigating irrelevant alerts take time away from remediating important issues, but it also compounds alert fatigue.