Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Incident Resolution for Remote Teams

People working in IT support and incident management right now are faced with unusual difficulties supporting large remote workforces and managing unpredictable workloads. On Reddit, system admins and other IT pros are bemoaning the hiccups and hassles of working in isolation while trying to resolve issues and maintain high SLAs. You can’t go grab your indispensable SME for troubleshooting, because that person is also home and inundated with messages and alerts from many different tools.

Configure an Intuitive Service Dashboard & Reduce Response Time

Leverage Multiple Alert Sources in Squadcast to reflect your actual system infrastructure on your Service Dashboard Having your Incident Management Tool reflect your system architecture is a big milestone in reducing cognitive load on your on-call team. In order to help our users move one step closer to this milestone, we recently released the functionality to add multiple alert sources to a service. You can now model your service dashboard to mimic your actual system architecture.

Take huge leaps with Honeycomb for Incident Response

As engineering teams shift from delivering services on monolithic architectures to microservices and even serverless environments, developers are no longer just responsible for creating and maintaining their code. Shared ownership has become the new normal (or at least trending towards) and so they are now responding to production incidents and in some cases in the on-call rotation. Of course incidents vary in terms of impact, but they do take time away from innovation and creating new capabilities.

Black Swans and Grey Rhinos - Observations on Coronavirus and IT Ops During Crisis

As the Coronavirus crisis unfolds and all of us struggle to understand its implications and to adapt, many thoughts come to mind on many different levels – personal, business related, philosophical. This event is definitely a game changer, in the near future for sure – and many say in the long run as well.

Moogsoft and PagerDuty: Boosting DevOps Teams' Productivity and Incident Resolution with AIOps

Today, the customer experience drives IT on all levels. In our digitally transformed world, we do everything online — transact, interact, purchase and more. This mandates constant change and zero downtime. Ironically, as enterprises adopt IT innovations, IT environments get harder to manage and impact the productivity and agility of DevOps and SRE teams — and as a result, the customer experience suffers.

Moogsoft and Atlassian JIRA and Opsgenie: Put the Dev and Ops in DevOps!

DevOps has become the go-to-approach for IT to accelerate their ability to achieve business requirements and ensure the quality of the customer experience. Today’s economy and the customer experience drives IT across the entire stack. Our world has become digitally dependent, which mandates an ever-evolving IT environment that’s on-demand and always available.