Latest News

Development Pipeline: What should you consider?

Mar 29, 2023 By Aman In Zenduty

As software development continues to evolve and become more complex, the need for efficient and effective deployment strategies has become increasingly important. This is where deployment pipelines come in. When it comes to software development, a deployment pipeline is a powerful automated tool that facilitates the fast and accurate transition of new code changes and updates from version control to the production environment.

Read Post

Zenduty

Read more about Development Pipeline: What should you consider?

Mean Time to Acknowledge (MTTA): What It Means & How To Improve MTTA

Mar 29, 2023 By Muhammad Raza In Splunk

The sooner you know about a problem, the sooner you can address it, right? Imagine if you could do that in your most important apps and software. Well, that’s exactly what MTTA measures. Let’s take a look.

Read Post

Splunk

Read more about Mean Time to Acknowledge (MTTA): What It Means & How To Improve MTTA

How to reduce mean time to act by tracing alerts with AIOps

Mar 29, 2023 By Jason Walker In BigPanda

This is the story of an insurance company that was getting six million IT alerts every 90 days and how they used BigPanda’s AIOps to reduce it to less than 50,000. Before we get into that though, let’s take a step back. How did we, as an IT sector, get to a place where organizations receive 6,000,000 IT alerts in the first place?

Read Post

BigPanda

Read more about How to reduce mean time to act by tracing alerts with AIOps

Announcing our improved Slack integration

Mar 28, 2023 By Vishal Padghan In Squadcast

Slack is one of the most widely used messaging Apps, providing collaboration and chat solutions to businesses. We at Squadcast understand that most of your work happens over Slack. Hence, we have made improvements to our Slack integration capabilities by introducing a bunch of UI and functional improvements. This blog will give you an overview of the latest improvements supported by this integration, which we hope will help in better collaboration and Incident Management.

Read Post

Squadcast

Read more about Announcing our improved Slack integration

PagerDuty Announces New Automation Enhancements That Simplify Operations Across Distributed and Zero Trust Environments

Mar 28, 2023 By Joseph Mandros In PagerDuty

Be sure to register for the launch webinar on Thursday, March 30th to learn more about the latest release from the PagerDuty Operations Cloud. Rundeck by PagerDuty has long helped organizations bridge operational silos and automate away IT tasks so teams can focus more time on building and less time putting out fires. And while this mission still rings true today, our vision is to extend this reality and revolutionize all operations while continuing to build trust.

Read Post

PagerDuty

Read more about PagerDuty Announces New Automation Enhancements That Simplify Operations Across Distributed and Zero Trust Environments

What Is MTTR?

Mar 28, 2023 By StatusCast In StatusCast

Mean Time To Repair, or MTTR, is a critical metric in IT incident management that measures the average time it takes to fix a system failure. The meaning of MTTR can be understood as the average duration needed for an IT team to recover from an incident. It is a fundamental metric for IT teams to track and analyze their efficiency in resolving incidents.

Read Post

StatusCast

Read more about What Is MTTR?

Bring Order to On-call Chaos With Splunk Incident Intelligence

Mar 27, 2023 By Annette Sheppard In Splunk

In today’s turbulent times, companies big and small are being pushed to do more with less. Budgets are getting tighter and companies are being pressured to serve customers who demand 24/7 availability from their applications and services. To meet these demands and remain competitive, enterprises are adopting cloud-first strategies and developing applications with microservice architectures.

Read Post

Splunk

Read more about Bring Order to On-call Chaos With Splunk Incident Intelligence

The Evolution of Incident Management from On-Call to SRE

Mar 24, 2023 By Vardhan NS In Squadcast

Incident Management has evolved considerably over the last couple of decades. Traditionally having been limited to just an on-call team and an alerting system, today it has evolved to include automated Incident Response combined with a complex set of SRE workflows.

Read Post

Squadcast

Read more about The Evolution of Incident Management from On-Call to SRE

How FireHydrant handled the SVB banking crisis

Mar 24, 2023 By Robert Ross In FireHydrant

On Thursday, March 9, 2023, something was afoot at our primary bank, SVB. By Friday, March 10, 2023, messages from our investors helped us quickly understand that FireHydrant needed to maneuver through a complex incident that was unfolding. Operational incidents are incidents like every other.

Read Post

FireHydrant

Read more about How FireHydrant handled the SVB banking crisis

Get data-driven executive communication out of the box with Reliability Insights

Mar 23, 2023 By Alex Greer In Blameless

Blameless’s comprehensive incident management platform is built to ease the burden of keeping your services up and running. Whether you are in the middle of an incident or trying to better track your response performance, you need access to your incident data on demand. Blameless’s Reliability Insights unifies your Incident, Resource, Task, and IAM data in a single customizable and queryable analytics tool.

Read Post

Blameless

Read more about Get data-driven executive communication out of the box with Reliability Insights

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Development Pipeline: What should you consider?

Mean Time to Acknowledge (MTTA): What It Means & How To Improve MTTA

How to reduce mean time to act by tracing alerts with AIOps

Announcing our improved Slack integration

PagerDuty Announces New Automation Enhancements That Simplify Operations Across Distributed and Zero Trust Environments

What Is MTTR?

Bring Order to On-call Chaos With Splunk Incident Intelligence

The Evolution of Incident Management from On-Call to SRE

How FireHydrant handled the SVB banking crisis

Get data-driven executive communication out of the box with Reliability Insights

Monthly Archive

Follow Us