Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Unified observability for Alibaba Cloud with Datadog

Alibaba Cloud is a major cloud provider in APAC, offering industry-leading foundational AI models in addition to compute, managed databases, object storage, and Kubernetes through its Container Service for Kubernetes (ACK). Teams choose Alibaba Cloud for its infrastructure availability across Asia Pacific and its managed services. For SREs and platform engineers, that often means running Alibaba Cloud alongside AWS, Google Cloud, or Microsoft Azure.

The Kubeshark Workflow That Doesn't Stop at the Dashboard

The Observability Gap shows up the moment you try to reproduce a production bug locally. Your traces tell you a request was slow. Your logs tell you which line printed. Neither tells you what was actually on the wire: the headers, the JSON body, the surprise field your client started sending last Tuesday. Until now, closing that gap meant SSHing to a node, attaching a debugger, or shipping a sidecar through change review.

What is AI-Powered Observability? A Complete Guide for IT Teams in 2026

Is your monitoring stack really giving you clarity, or just more alerts? Your monitoring stack is probably working exactly as designed. That is the problem. As systems grow, most IT and platform teams start to see the same patterns: At this point, traditional monitoring starts to feel limited. This is where teams begin exploring AI in observability. In this guide, we will explain what AI-powered observability actually means, how it works, and when it is useful.

Everything We Talked About at O11yCon 2026

We just wrapped O11yCon 2026, and this year's conversations hit differently. Agent-based software development is here, now. It's no longer an optional choice, and everybody is struggling to understand what their agents are doing and how to make them cost less and perform better. Over the course of fifteen talks, we saw clearly that the old assumptions on how and who (or what) writes our software has been upended. Here are some highlights. We'll have videos available in the near future.

Search Azure Blob data in-place with BYOS for Cribl Lake

See how Bring Your Own Storage (BYOS) in Cribl Lake allows teams to connect directly to Azure Blob Storage and instantly search data in place — without moving, duplicating, or rehydrating telemetry. In this demo, Cribl Product Manager Risk Salsa walks through setup, dataset creation, and how to run fast investigations across your Azure-hosted data using Cribl Search.

Observability Expanding Beyond Infrastructure and Into AI Systems

Observability revolves essentially around understanding infrastructure health. This means that operations teams monitor applications, netwo0rks, database and cloud environments using familiar signals. They use logs, metrics, latency, uptime measurements, and traces. If systems remain available and the performance stays within expected thresholds, the teams have enough visibility to understand whether applications are functioning properly.

Inside the Grafana AI Team Weekly: Guards for AI Observability (May 5, 2026)

This is an excerpt from a real AI team weekly meeting where we talk about the stuff we build and occasionally also demo them! In this one, Principal Software Engineer Sven Großmann shows a new feature he's working on for AI Observability, called "guards". We're showing parts of our team meetings to build in public in some small way and give you a sneak preview of what's to come. But not all features we show may make it to production! You've been warned. :)

Your Microsoft Azure storage, our data lake power: The best of both worlds

The wait is over for Azure-first organizations. Cribl just launched Cribl Lake Bring Your Own Storage (BYOS) for Microsoft Azure, giving you full data lake power without moving a single byte of telemetry out of your environment. Join us to see how you can finally get the flexibility of a modern data lake while keeping your data in Azure.

Why Traditional Observability Breaks Down in Hybrid Cloud Environments

Hybrid cloud has reshaped the way enterprises build, run, and troubleshoot digital services. Applications now stretch across on-premises infrastructure, cloud platforms, regional services, interconnects, and distributed dependencies that change constantly. Operational complexity has expanded with that footprint, yet many observability practices still reflect assumptions from an earlier era of simpler architectures and clearer boundaries. That gap shows up fast during an incident.

The Complete Guide to Observability Pipelines

Modern engineering teams are drowning in telemetry data. A mid-sized Kubernetes cluster running 50 microservices can generate millions of log lines per minute. Add distributed traces, Prometheus metrics, cloud provider events, and application-level instrumentation and you're looking at terabytes of observability data every day. The problem isn't just volume. It's what you do with it.