Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

A Journey Through Blameless from Incident to Success

Here at Blameless, every aspect of our product has SLOs (Service Level Objects) and error budgets in order to help us understand and improve customer experience. Sometimes, these error budgets are at risk, triggering an incident. While incidents are often painful, we treat them as unplanned investments, striving to learn as much as we can from them. We empower all of our engineers to handle an on-call rotation, no matter how difficult the issue.

Do More and Work Where You Are With Our New Integration for Jira Server and Data Center

Many of you may be reading this blog from home, a remote office somewhere, a family member’s house, or—if Zoom backgrounds are to be trusted—the cockpit of the Millennium Falcon. We’re all learning how to get better at “working where we are,” and that includes optimizing the tool stack you use each day.

Sticking to Your SLAs with FireHydrant Runbooks

In today’s world, systems are increasingly becoming more and more complex. Due to this complexity, it’s no longer a matter of “if” our systems will fail but “when”. To manage expectations for when our systems do fail, we can look no further than our Service Level Agreement.

Incident Response with Atlassian's Opsgenie

Learn all about Incident Response with @Atlassian 's Opsgenie. Respond to incidents from the Incident Command Center, identify potential root cause from the Incident Investigation view, and keep track of key information within the Incident Timeline. Once resolved, easily fill out the postmortem template and export to Confluence.

Zenduty - Microsoft Dynamics Integration

Microsoft Dynamics is a line of enterprise resource planning and customer relationship management software applications. Microsoft markets Dynamics applications through a network of reselling partners who provide specialized services. Microsoft Dynamics forms part of "Microsoft Business Solutions". The Zenduty-Dynamics integration helps you escalate critical cases/incidents to the right team, proactively alert them about SLA violations and bring in SMEs and stakeholders into high priority cases. To know more about the Integration,

PagerDuty Slack Integration How-To Video

Learn how to install, configure, and test the PagerDuty Slack Integration and work wherever you are. Many modern ITOps and DevOps teams count on Slack to keep everyone on the same page when things are running smoothly—and perhaps even more so when they aren’t. Slack users can do things like reassign or escalate an incident and view additional incident context—all from within Slack. The PagerDuty platform also allows users to create an incident war room Slack channel from within PagerDuty, adding additional users to it as the situation evolves.

SRE Leaders Panel: Work as Done vs Work as Imagined

Blameless recently had the privilege of hosting some fantastic leaders in the SRE and resilience community for a panel discussion. Our panelists discussed the effects of imposter syndrome especially during high tempo situations, how to use it to our advantage and overcome doubt, and how culture directly affects the availability of our systems. The transcript below has been lightly edited, and if you’re interested in watching the full panel, you can do so here.