Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Why you can't have AIOps without Data Engineering

There’s a familiar saying: garbage in, garbage out. For ITOps, this directly applies to data engineering. BigPanda’s Area Vice President of Value and Adoption, Craig Ferrara, says the importance of data hygiene—putting good data in to get good data out—is the core of data engineering, and it requires ITOps to take a look at their data before integrating with an AIOps solution.

Sponsored Post

Reducing Security Incidents: Implementing Docker Image Security Scanner

Are you utilizing Docker to deploy your applications? If so, you're not alone. The use of Docker has skyrocketed in popularity in recent years. While it offers numerous benefits, it also introduces new security risks that need to be addressed. But, why is reducing security incidents so important? Simple - the cost of a security breach can be devastating. From lost customer trust to financial losses, the consequences of a security incident can be severe. That's why it's crucial to take steps to prevent them from occurring in the first place. Enter Docker image security scanners.

Incident Workflows

Time is of the essence when responding to incidents and within seconds all the right responders need to be mobilized and the right stakeholders informed. PagerDuty Incident Workflows empowers teams with sophisticated automation capabilities to reduce the manual work required to escalate and mobilize team members. Using if-this-then-that logic on our no-code/low-code builder you can orchestrate and automatically trigger the right set of incident actions for your needs at any time.

Taking the fear out of migrations

Over the last 18 months at incident.io, we’ve done a lot of migrations. Often, a new feature requires a change to our existing data model. For us to be successful, it’s important that we can seamlessly transition from the old world to the new as quickly as we can. There are few things in software where I’d advocate a ‘one true way,’ but the closest I come is probably migrations. There’s a playbook that we follow to give us the best odds of a smooth switchover.

S1E1: Maximize service uptime with efficient incident management workflows [Cloud]

In this episode of Masterclass 2023, we'll cover how IT service management teams can utilize ServiceDesk Plus Cloud to quickly handle incidents and streamline the incident resolution process in a hybrid work setting. You'll learn how ServiceDesk Plus Cloud can enhance the effectiveness of incident management practice through collaboration, dynamic template creation, automation, and more.

S1E1: Maximize service uptime with efficient incident management workflows - Masterclass 2023

In this episode of Masterclass 2023, we'll cover how IT service management teams can utilize ServiceDesk Plus to quickly handle incidents and streamline the incident resolution process in a hybrid work setting. You'll learn how ServiceDesk Plus can enhance the effectiveness of incident management practice through collaboration, dynamic template creation, automation, and more. Useful resources Follow us on social.

[PODCAST] Season 2 - Episode 1 The ITOps 2023 predictions; what does the future hold.

What will 2023 hold for ITOps? As we look back to 2022, its stellar growth for many companies and positive hiring trends, we hope that 2023 is even more successful for those involved in ITOps. In this episode, we take a deep dive into #predictions for 2023 and the future of #ITOps.#aiops #ITOps #podcast

[PODCAST] Season 2 - episode 3 - Resolving unforseen ITOps events

Even the best teams can encounter outages. Sometimes there's environmental anomalies in the data center or a component failure that leads to unplanned downtime. In this episode, we explore how IT teams can limit the impact of outages to business operations and resolve them when they arise.#itops #aiops #podcast

Game Day: Stress-testing our response systems and processes

At incident.io, we deal with small incidents all the time—we auto-create them from PagerDuty on every new error, so we get several of these a day. As a team, we’ve mastered tackling these small incidents since we practice responding to them so often. However, like most companies, we’re less familiar with larger and more severe incidents—like the kind that affect our whole product, or a part of our infrastructure such as our database, or event handling.