Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

5 ways teams used BigPanda during the CrowdStrike outage

In the weeks since the Crowdstrike outage brought millions of systems to a halt, countless articles have been written about the cause of the outage, its impact, and the costs companies incur during service disruptions. Nearly every large company had hosts offline due to the faulty update in CrowdStrike’s Falcon software. BigPanda customers were no exception. On July 19, between 04:00 and 07:00 UTC, the BigPanda systems logged an increase in shared incidents.

Alert noise reduction: How to cut through the noise

ITOps and AIOps teams often face an overwhelming volume of notifications, many of which are false positives or low-priority alerts. The constant influx creates a chaotic environment. ITOps and AIOps teams can easily miss critical issues, potentially leading to system failures or prolonged downtime. Spending significant time sifting through irrelevant alerts reduces team efficiency and slows response. Focus on alert noise reduction to ensure that only meaningful and actionable alerts reach your teams.

Navigating the Incident Management Lifecycle: A Complete Guide

Ever wonder why some IT teams can quickly resolve incidents while others struggle? The secret lies in mastering the Incident Management lifecycle. But don’t worry—this isn’t some dull, complicated process only experts can understand. The Incident Management lifecycle is simply a structured approach to handling incidents efficiently. And the best part? You can quickly get the hang of it.

What is ISO 27001 Incident Management? Definition and Process

Managing incidents is crucial to maintaining the security and integrity of an organization's information systems. ISO 27001 Incident Management provides a structured approach to addressing and resolving incidents in a way that minimizes impact and prevents recurrence. This framework doesn't just help organizations respond to incidents—it helps them create a robust system that anticipates and mitigates risks before they escalate.

Grafana Tempo 2.6 release: performance improvements and new TraceQL features

Grafana Tempo 2.6 is here with performance improvements and buckets of new TraceQL features! Watch the video above for an overview of the new TraceQL features, or continue reading to get a quick overview of the latest updates in Tempo. If you’re looking for something more in-depth, don’t hesitate to jump into the Grafana Tempo 2.6 release notes or the changelog.

Elastic extends Express Migration program for Splunk logging customers

Observability is undergoing a massive shift as enterprises drive adoption of modern technologies, including cloud and microservices, along with disruptive technologies, such as generative AI (GenAI). To keep pace with the complex requirements of the modern tech stack, operations teams need to consider and adopt next-generation observability. Splunk users are often challenged by using products that provide fragmented observability, hampering their ability to modernize their environments.

Monthly Recap: August 2024 + Looking Ahead to September

August has been an outstanding month for the Cycle team, marked by significant advancements across the platform. We've introduced new features aimed at simplifying workflows and enhancing user experience. As we wrap up August and look ahead to September, we're excited to share the progress we've made and what you can expect in the coming weeks.

Enhancing Operational Efficiency with DX NetOps Integrations

In many organizations, network teams are experiencing a significant skills shortage. The network operations center (NOC) requires expertise in various emerging technologies, which makes it increasingly challenging to find qualified candidates with the right skills. A recent survey revealed that in 2022, only 26% of companies found it somewhat to very difficult to hire networking professionals. By 2024, this figure had risen to 41%.