Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Trends in Mainframe Modernization: Fresh Insights from SHARE Orlando

Fresh insights from SHARE Orlando reveal mainframe modernization isn't about replacement—it's evolution. From hybrid architectures to AI-driven automation, enterprises are transforming legacy systems into agile, integrated platforms while preserving core reliability.

Observability for Azure Virtual Desktop with SquaredUp

Managing Azure Virtual Desktop doesn’t have to mean jumping between portal blades, logs, and metrics trying to piece together what’s happening. In this webinar, you’ll learn how to design and implement a single, operational observability dashboard for Azure Virtual Desktop (AVD) using SquaredUp Cloud — transforming fragmented telemetry into clear, actionable insight. Whether you're responsible for performance, user experience, or operational stability, this session will give you a structured, repeatable framework for monitoring your AVD estate with confidence.

Datadog Incident Response: One platform from alert to resolution

When incidents strike, speed and clarity are critical. Datadog Incident Response brings the full incident lifecycle into one platform so teams can move from detection to resolution with confidence. Operate from a single, unified view of your systems, coordinate across the tools your teams already use, and leverage AI that analyzes incidents in real time to surface context, guide decisions, and accelerate resolution.

What is Agentic Observability?

Agentic observability is the instrumentation and correlation needed to explain and control agent behavior across multi-step workflows. Legacy observability focuses on runtime health and service behavior. You monitor metrics like CPU usage, memory, latency, and error rates to confirm that applications and infrastructure are functioning as expected. When a workflow degrades, the proximate cause is often a crash, timeout, permission error, or resource constraint.

How Autonomous Are Your IT Operations, Really?

This post introduces a six-level maturity model that defines what true autonomy looks like in IT operations, from basic AI chat interfaces to fully coordinated agent ecosystems. ITOps teams have more automation tooling than ever, and yet incident response still depends heavily on human judgment to hold it together. Alerts fire, engineers dig through dashboards, context gets assembled by hand, and someone at the end of the workflow makes the final call.

Best Rails APM Tools in 2026: A Developer's Guide

Rails applications have a specific set of performance challenges that make monitoring genuinely useful rather than just box-checking. ActiveRecord is convenient to use and also convenient to accidentally write N+1 queries with. Memory bloat in long-running processes, particularly when Sidekiq or Action Cable is involved, is a recurring production problem for a lot of teams. Background job performance tends to degrade quietly until it becomes noticeable.

Accelerate Vulnerability Remediation with Atatus: From Detection to Secure Deployment

In microservices and cloud-native environments, vulnerabilities buried in transitive dependencies or runtime behaviors can go undetected for weeks. During that time, your attack surface keeps expanding and production systems remain exposed. The longer remediation is delayed, the greater the risk of exploitation, compliance failures, and operational disruption.

Episode 6 - The evolution from automation to autonomy

Tom and Akhilesh unpack why automation alone will never deliver autonomy, and why intelligence means anticipating change rather than constantly reacting to it. They explore the role of people in enterprise transformation, the limits of technology without trust and context, and why the most powerful use of AI is freeing humans to focus on what they do best. Plus, Akhilesh makes the case for ping pong as a surprisingly effective way to reset when the pressure is on.