Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Now you see me, now you don't: feature-flagging with LaunchDarkly at incident.io

At incident.io, we ship fast. We're talking multiple times a day, every day (yes, including Fridays). Once I merge a pull request (PR), my changes rocket their way into production without me lifting a finger. 💅 It's when we tackle larger projects that this becomes a bit more complicated. We recently launched Announcement Rules, which let you configure which channels incident announcements are posted in depending on criteria you define.

Your Ops and DevOps teams need to work together, and fast. Who you gonna call?

The world is moving fast, led by an ever-accelerating IT landscape. In recent years, two distinct types of teams have emerged that assist in driving this business transformation: DevOps/SRE teams that are in charge of driving rapid innovation of products and services, and IT Ops/NOC teams that focus on preventing outages and maintaining the high level of quality, reliability and serviceability that modern, discerning customers expect.

How Playbooks improve customer service delivery, agent productivity

We all know one bad experience can impact a customer’s perception of—and even willingness to deal with—an organization going forward. That’s why so many companies, in virtually every industry, have made investing in customer experience (CX) a top priority, according to ResearchAndMarkets.com. The problem is, for any given organization, there are a number of customer service processes along the entire life span of an interaction that need to be looked at and made great.

New Apps for PagerDuty's Datadog Integration

Status Dashboard by PagerDuty and Incidents by PagerDuty are new apps available now in Datadog. See a live, shared view of system health to improve awareness of operational issues with Status Dashboard by PagerDuty. Acknowledge, troubleshoot, and resolve incidents with PagerDuty actions embedded directly in the Datadog interface to limit context switching among tools. Julia Nasser and Hadijah Creary join the stream to show off this powerful enhanced integration.

Make sense of complex systems with Dynamic Service Graph by PagerDuty

The Dynamic Service Graph breaks down silos between teams and provides organizations with a living, breathing asset that displays technical and business services and their relationships at scale. It allows teams to quickly grasp the state of services, visually digest the full impact radius of an issue, zero in on likely cause, and seamlessly facilitate cross-team collaboration.

Leaning on Technology in The New Noisy: Managing Cloud, Change and Risk

Your company’s “digital transformation” will be driven by new application designs and methods, new technology stacks, and new processes. To master it, and delivering next generation services through it, massively complex sets of signals and data need to be leveraged, processed, and acted on. Developers need integrated data and insights through that noise, while being able to leverage their tools of choice. All of this must be managed, even in spite of massive rates of change and innovation.