Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Incident Response Automation: How It Works & Why It Speeds Up Resolutions

Nov 8, 2024 By Vishal Padghan In Squadcast

The speed at which you respond to incidents can make or break user satisfaction, team morale, and business continuity. Whether it’s a server crash, a security breach, or a software bug affecting users, rapid and efficient incident management is key to maintaining a strong reputation and minimizing operational downtime. And while traditional manual responses have worked in the past, automated incident response is now paving the way for faster, smarter, and more efficient handling of these issues.

Read Post

Squadcast

Read more about Incident Response Automation: How It Works & Why It Speeds Up Resolutions

The 9 Best PagerDuty Alternatives in 2024

Nov 8, 2024 By Aman In Zenduty

As tech grows more dynamic, SRE (Site Reliability Engineering) teams constantly seek smarter, more efficient tools to manage incidents and alerts. While PagerDuty has been a go-to solution, many teams are discovering the limitations of outdated legacy tools. With high costs, rigid integrations, and feature bloat, it’s understandable why so many are exploring PagerDuty alternatives that offer streamlined, budget-friendly, and innovative solutions for incident management.

Read Post

Zenduty

Read more about The 9 Best PagerDuty Alternatives in 2024

Demo Roundups! Automation Standardization (Workflows)

Nov 8, 2024 By PagerDuty In PagerDuty

Join PagerDuty’s Solutions Consultants Bobby Zimmerman and Justyn Roberts to discover how combining technical automation with human-driven processes can reduce manual interventions, streamline repetitive tasks, and increase operational efficiency. Level up your digital operations expertise with PagerDuty Demo Roundups — a series of live, interactive webinars where you can deepen your knowledge in the Operations Cloud and see how PagerDuty can work for you. Each 1-hour session presents a hands-on demo that showcases PagerDuty’s capabilities in real-time followed by Q&A.

View Video

PagerDuty

Incident Management

Read more about Demo Roundups! Automation Standardization (Workflows)

How we model our data warehouse

Nov 8, 2024 By Jack Colsey In Incident.io

We've written several times about our data stack here incident, but never about our underlying data warehouse and the design principles behind it. This blog post will run through the high-level structure of our data warehouse and then will go in-depth into the underlying layers.

Read Post

Incident.io

Read more about How we model our data warehouse

Stop, Drop, and SEV4: Why small incidents are a big deal with Derek Brown

Nov 7, 2024 By Incident.io In Incident.io

Watch Derek's full talk from SEV0 here: https://go.incident.io/a8xPaeB

View Video

Incident.io

Incident Management

Read more about Stop, Drop, and SEV4: Why small incidents are a big deal with Derek Brown

Site Reliability Engineer's Guide to Black Friday

Nov 7, 2024 By Zoe Collins In OnPage

It’s gotten to the point where Black Friday reliability prep has to start on…well Black Friday. This year, 32% of consumers in the US claimed that they were going to start their holiday shopping in July-October. Plus, Black Friday isn’t the only day eCommerce businesses have to worry about, now we have Cyber Monday, Travel Tuesday, and the thousands of Prime Days from Amazon.

Read Post

OnPage

Read more about Site Reliability Engineer's Guide to Black Friday

Runbook Automation and Rundeck v5.7 Release Notes

Nov 7, 2024 By PagerDuty In PagerDuty

Product Managers Jake and Forrest join us for a spooky stream to talk about the Runbook Automation and Rundeck release v5.7. Project Runner Management is now generally available.

View Video

PagerDuty

Read more about Runbook Automation and Rundeck v5.7 Release Notes

Engineering an AI Proxy for ilert

Nov 7, 2024 By Daria Yankevich In iLert

Building an AI proxy for our AI features was one of the best decisions we made a year ago. In this article, we will share why and what challenges we faced.

Read Post

iLert

Read more about Engineering an AI Proxy for ilert

Lessons from 4 years of weekly changelogs

Nov 7, 2024 By Pete Hamilton In Incident.io

Writing a meaningful update for customers every week has been held sacred at incident.io since we started the company. We've written over 200 of them in the past 4 years, and we recently celebrated going 2 years straight without missing a single a single week The numbers themselves are not the goal, but the consistency of this habit and what it represents for our customers and our team is very real, and special to me.

Read Post

Incident.io

Read more about Lessons from 4 years of weekly changelogs

Operationalizing AI for IT operations

Nov 6, 2024 By Conor Castronovo In BigPanda

Advances in artificial intelligence are rapidly transforming the IT operations landscape. According to Enterprise Strategy Group, 85% of organizations use or plan to deploy AI across many functional areas, including IT operations. Among its many benefits, AI can help ITOps teams: AI has immense potential to transform how IT operations, service management, and infrastructure teams function. Adoption is the first step toward creating organizational change.

Read Post