Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

MCP Observability with OpenTelemetry

2025 has truly been the year of Agentic AI, with MCP (Model Context Protocol) emerging as one of its flashy and most talked-about innovations. While many products have seamlessly integrated MCP servers into their systems, these servers are increasingly being labelled as black boxes, opaque components that handle critical tasks but offer little visibility into what's happening under the hood. We prompt an agent, a tool gets invoked, and a response is generated. But what really happens in between?

Can Claude Code Observe Its Own Code?

One of the great things about OpenTelemetry is that it’s a standard, and standards tend to proliferate. I was excited to see Claude Code add OpenTelemetry metric and log support in a recent release. What was really interesting—beyond the ability to capture usage data from Claude Code—is that you can also get pretty detailed logs about what you’re doing with Claude Code.

Why GovRAMP-authorized observability matters for state, local, and education IT teams

Building on our FedRAMP Moderate authorization and our “In Process” status for FedRAMP High, Datadog for Government is now "In Process" for GovRAMP High Authorization, giving agencies a unified observability platform that meets the toughest public-sector security bars.

Operational Intelligence - the new horizon of observability

Monitoring your systems isn't enough anymore. Neither is “asking questions about your system”. Operational Intelligence embraces observability to proactively deliver business insights, support decision-making, and accelerate innovation. It seems that as the observability market grows and more and more products come into the space, the meaning of the term observability itself becomes more and more nebulous.

A Quick Guide To Kubernetes Observability

Many companies are rapidly adopting cloud-native computing services, like containers, microservices, and serverless computing. Unlike monolithic applications, these technologies rely on distributed architectures. Whether you are running them in the cloud, on-premises, or both, distributed systems consist of thousands or millions of processes and components. The challenge now is to make these complex systems’ inner workings visible, controllable, and improvable.

The Open Source Observability Podcast - EP #1: Clickhouse, Data Lakes, and AWS S3 with Joshua Lee

In this episode we get to dive into some of Josh's favourite databases and telemetry sources for observability. Listen to learn what open source software you could benefit from including in your toolstack! Joshua Lee is a Developer Advocate at Altinity, where he applies his observability and engineering background to ClickHouse use cases and creates educational content to support the open source community. He has over 15 years of experience in leading software projects for a broad scope of industries.

From Detection to Resolution: How Selector + Itential Deliver AI-Driven Observability and Automated Recovery

Every second counts when it comes to detecting, diagnosing, and resolving network incidents, yet many teams still find themselves stuck in reactive mode, drowning in alerts, manually writing scripts, and managing tickets across disconnected systems. This is where Selector and Itential come in. Together, Selector and Itential deliver a powerful, enterprise-ready solution that closes the loop between detection and action.

Can AI/ML Guide Observability? Tech Talk #6

This talk will examine the application of Artificial Intelligence and Machine Learning in observability. It will cover how AI/ML is being used to monitor systems, detect anomalies, and extract insights from telemetry data. The session will provide information on integrating AI/ML into observability pipelines, improving analytical capabilities, and system performance.