Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

October 2019 Update: Improved usability and new emergency alarm triggering in the app

The October update of the mobile app includes the improvements described below and the user experience includes a new feature. The user interface of the app has been improved so that you can now decide for yourself if you want to see all Signls on the dashboard differentiated by categories. Using the “More” button on the dashboard’s Signl widget, you can now open a context menu and show or hide the display of the categories.

What is MTTD? Mean Time to Detect Explained In Detail

This post will answer a simple question, “What is MTTD?” The answer—or at least the start of it—was already spoiled by the post title. Sure enough, MTTD stands for “Mean time to detect.” It refers to an important KPI (key performance indicator) in DevOps. Is the question answered? Can we call it a day with that definition? Of course not.

Blameless Culture Key to Addressing Outage Outrage in Australia

After the unfortunate Commonwealth Bank of Australia outage last week, the powerful Payment Systems Board—whose members include the chairs of the RBA and APRA – announced it would make all outage data public to prevent banks, payment schemes, and telecommunications carriers from “hiding behind” the performance statistics shared by each institution.

Service Monitoring and You

Monitoring is an art form. That sounds cheesy and lazy, but the right kind of monitoring is very context-dependent and rarely does the same practice work across multiple pieces of software or people. This gets even harder when you think about modern software architectures. Microservices? Container schedulers? Autoscaling groups? Serverless? ${New-technology-that-will-solve-all-of-my-problems-but-probably-creates-other-problems}?

Tips for modern NOCs - simplifying IT operations team reporting

The dreaded monthly operations and application meeting is edging near. You know which meeting we’re talking about: the one where all the Ops teams get together to talk about what they are seeing, about application enhancements and modifications, potential improvements, and often – about their frustrations!

Happy National Cybersecurity Awareness Month!

October is the month of spooky scares, so it makes sense that National Cybersecurity Awareness Month is also recognized at this time—after all, what’s more scary than, for example, having someone phish for your personal information and using said info to ruin your credit or losing your password to hackers so they have access to your bank account?

Modernizing Your Digital Operations with Sumo Logic and PagerDuty

As digital transformation continues to be central to an organization’s growth mandate, it’s critical to ensure that customer-facing, revenue-generating, mission-critical applications are operationally reliable and secure. That’s where Sumo Logic comes in—for almost 10 years, we have been providing a Continuous Intelligence platform for DevSecOps that’s utilized by over 2000+ customers in almost every vertical.