Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Approaching your observability migration with the right mindset

This guest blog post is authored by Nick Vecellio, Principal Engineer and Co-founder of NoBS, a Premier Datadog Partner specializing in hands-on Datadog migrations and optimizations. At NoBS, we help enterprises migrate their observability stack to Datadog. Teams often come to us after a migration has technically “worked,” but the new setup requires optimization tweaks to provide the clarity, reliability, or operational benefits they’re looking for.

What is Agentic Observability?

Agentic observability is the instrumentation and correlation needed to explain and control agent behavior across multi-step workflows. Legacy observability focuses on runtime health and service behavior. You monitor metrics like CPU usage, memory, latency, and error rates to confirm that applications and infrastructure are functioning as expected. When a workflow degrades, the proximate cause is often a crash, timeout, permission error, or resource constraint.

Top 12 AI and LLM Observability Tools in 2026 Compared: Open-Source and Paid

Artificial intelligence has moved far beyond experimentation. In 2026, AI systems are embedded into customer support workflows, clinical decision support tools, fraud detection engines, and internal copilots across nearly every industry. Adoption is accelerating quickly. According to McKinsey, 23% of organizations are already scaling agentic AI systems, while another 39% are actively experimenting with them. Yet the path to reliable production AI remains uncertain.

Observability for Azure Virtual Desktop with SquaredUp

Managing Azure Virtual Desktop doesn’t have to mean jumping between portal blades, logs, and metrics trying to piece together what’s happening. In this webinar, you’ll learn how to design and implement a single, operational observability dashboard for Azure Virtual Desktop (AVD) using SquaredUp Cloud — transforming fragmented telemetry into clear, actionable insight. Whether you're responsible for performance, user experience, or operational stability, this session will give you a structured, repeatable framework for monitoring your AVD estate with confidence.

Full-Stack Observability Is Becoming a Business Imperative

As enterprises accelerate digital transformation, technology performance has become inseparable from business performance. Customer experiences, revenue streams, and operational efficiency increasingly depend on the reliability of complex, distributed systems. In this environment, full-stack observability is no longer a technical aspiration — it is a strategic necessity.

Your Questions About AI-Assisted Development Answered

We recently hosted a webinar on AI-assisted development with DORA, and the audience had a lot of questions—far more than we could get to in an hour. I picked out six that get at the stuff people are wrestling with day to day. These aren't the easy questions, and I don't think there are necessarily easy answers, but I've spent the past year building and shipping with AI coding tools and observing (literally) what happens when that code hits production. Here's what I have.

Public Sector Observability: Service Experience and Reliability Are Now Mission-Critical

Reliable digital services aren’t optional for public sector agencies. They’re essential to mission success. Across the U.S. public sector, service experience and reliability have moved from operational concerns to mission requirements. At a federal level, Executive Order 14058 makes improving service delivery and customer experience a federal priority, measured by real outcomes for the public. And for state and local governments, the bar is set by the private sector.

Centralizing Docker Logs for Observability and Security

Most people can remember the old game of telephone, the stream of whispered sentences or phrases across a group of kids. At each transmission, a different piece of information gets lost or misheard, leaving the last person with an incomplete or incomprehensible statement. Managing Docker logs can feel the same way, especially when an error message is lost or an error message lacks context.

5 Essential Capabilities that Make Coralogix an Observability Powerhouse

Sometimes observability can feel like a second job. With many traditional tools, users must become experts in a proprietary language to ask a simple question. In these cases, developers or SRE’s can find themselves spending more time manually sifting through raw text, building complex data pipelines from scratch, and bouncing between fragmented dashboards than actually solving problems.