Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Black Swans and Grey Rhinos - Observations on Coronavirus and IT Ops During Crisis

As the Coronavirus crisis unfolds and all of us struggle to understand its implications and to adapt, many thoughts come to mind on many different levels – personal, business related, philosophical. This event is definitely a game changer, in the near future for sure – and many say in the long run as well.

How We Use Blameless to Power Remote Work

As with all other companies, the Blameless team is adapting to a world of remote work where distributed teams will need to get better than ever at staying aligned and efficient. We’ve been relying on Blameless more and more to improve how we collaborate virtually. Here are some of the top workflows and tips on how we have been using Blameless internally to streamline remote productivity.

5 ways you can Empower Your Remote IT Team

The coronavirus (COVID-19) pandemic has transformed the global workforce — and your enterprise is no exception. Enterprises are increasingly allowing staff to work remotely during this difficult time. Yet, many enterprises are still learning how to use remote workforce technology to effectively engage workers in real-time. AlertOps can connect your teams, regardless of location. It several features to power your remote workforce.

Moogsoft Enterprise 8.0 Redefines the Virtual NOC

Moogsoft Enterprise consolidates visibility and control of monitoring tools to help entire IT Ops and DevOps teams reduce noise, prioritize incidents, reduce escalations and ensure uptime. Working from anywhere, users can easily find and resolve the root cause of incidents before they become outages.

Moogsoft and PagerDuty: Boosting DevOps Teams' Productivity and Incident Resolution with AIOps

Today, the customer experience drives IT on all levels. In our digitally transformed world, we do everything online — transact, interact, purchase and more. This mandates constant change and zero downtime. Ironically, as enterprises adopt IT innovations, IT environments get harder to manage and impact the productivity and agility of DevOps and SRE teams — and as a result, the customer experience suffers.

Moogsoft and Atlassian JIRA and Opsgenie: Put the Dev and Ops in DevOps!

DevOps has become the go-to-approach for IT to accelerate their ability to achieve business requirements and ensure the quality of the customer experience. Today’s economy and the customer experience drives IT across the entire stack. Our world has become digitally dependent, which mandates an ever-evolving IT environment that’s on-demand and always available.

A "Retrospective" of Amy Tobey's "The Future of DevOps is Resilience Engineering"

April 22, 2020 at 11:20 AM PST, Amy Tobey began her talk “The Future of DevOps is Resilience Engineering” at Gremlin’s Failover Conf. This talk focused on key concepts from DevOps as a way to understand resilience engineering. Amy began by having the audience participate in a group breathing exercise, taking 3 deep breaths before speaking about the yoga practice of pranayama as a way to understand DevOps.

Sending Azure Monitor outage notifications to Microsoft Teams

Microsoft Azure is a cloud computing service providing infrastructure as a service (IaaS), software as a service (SaaS) and platform as a service (PaaS) supporting multiple Microsoft Specific and third-party services and systems with 90+ compliance offerings and trusted by 95% of Fortune 500 companies to base their business on. What is a system downtime and how does it affect me or my business?