Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Tech Talk - Splunk Observability for AI

In this Tech Talk, we’ll show you how Splunk’s agentic, AI observability delivers end-to-end visibility of the entire AI stack, from agents and large language models (LLMs) to the underlying infrastructure. You’ll see how AI Infrastructure Monitoring provides teams with data-dense dashboards and detectors for surfacing trends, patterns, and outliers to correlate application health with underlying AI infrastructure performance.

Reporting Exceptions to Honeycomb with Frontend Observability

So you've built a client application and you've started sending telemetry. The information sent back by this client is vital to you, and one of the first things you care about is capturing and reporting errors. There are at least two ways to report error details in OpenTelemetry. Web applications generally place exceptions in trace spans as span events, and mobile applications send exceptions as log messages instead.

Save the logs, save the planet: How to make your observability stack greener

If data centres were a country, they’d rank fifth in electricity consumption by 2026. Over the past few years, the resulting carbon footprint of the technology industry has sparked the fast-growing green software movement, led by the Green Software Foundation. How can we continue to innovate software in a way that also minimises its impact on the environment? This has been a fascinating problem I’ve been exploring for a few years now.

AI Observability in 2026: Why the data layer means everything

If there was ever a year for AI observability, it was 2025. Vendors released assistants to cover a variety of use cases. Coralogix released the first agent (distinct from assistants!), Olly, an autonomous, multi-agent observability platform. The direction of travel is clear, but many vendors and users are about to run into some significant problems with their data layer.

Top OpenTelemetry Backends for Storage & Visualization

OpenTelemetry backends provide storage, analysis, and visualization for telemetry data (traces, metrics, logs). This guide lists available OpenTelemetry-compliant backend options, categorized by use case: APM platforms, storage backends, visualization tools, and distributed tracing systems. For detailed comparison, see OpenTelemetry Backend Comparison.

How AI Agents automate incident response #ai #cybersecurity #telemetry

Clint Sharp demonstrates how Cribl Search leverages AI to streamline incident investigation. Starting from a Slack channel, the AI builds an interactive notebook, analyzes order processing logs, and identifies suspicious traffic spikes. It connects high CPU usage to a recent Jenkins deployment, hypothesizing a supply chain attack, and ultimately recommends a rollback. This isn't a far off concept. It is the future of operations arriving right now.