Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Why AI Will Push #Telemetry Budgets to the Breaking Point in 2026

Telemetry growth is about to hit a new level in 2026. Nick Heudecker from Cribl walks through our new predictions report and explains why observability costs are set to surge again, with more than a third of enterprises spending at least 15 % of their IT budgets on telemetry alone. He also shares how agentic AI adds new risk to the data pipeline, why most AI workloads will struggle to scale, and how platform shifts and market forces will reshape the data landscape.

#AI Powered Data Protection Inside Cribl Guard

Cribl Guard uses an always running AI agent to spot sensitive data as it moves through your environment and recommend the right protections in real time. In this demo, you will see how the agent samples live events, identifies patterns like credentials and credit cards, and turns them into one click fixes that keep your destinations safe. Faster detection, smarter rule recommendations, and instant mitigation. This is what modern data protection looks like.

Our latest updates across the VictoriaMetrics Observability ecosystem

We’re excited to announce a set of updates across the entire VictoriaMetrics open source products suite — including VictoriaMetrics, VictoriaLogs, VictoriaTraces, the VictoriaMetrics Kubernetes Operator. These improvements bring better performance, stronger security, enhanced metadata visibility, and a smoother experience when running observability at scale.

Honeycomb Frontend Observability - See Everything

Chapters: In this video we take a tour through Honeycomb's Frontend Observability offerings for Web and Mobile. We see how the launchpads can help spot performance errors, how errors that occur in the frontend can be traced all the way to their cause in other backend services easily with the error investigations feature, and how easy it is to find differences between traces across various devices.

Stop Leaking PII in Your #Telemetry with Cribl Guard

Sensitive data sneaks into destinations more often than teams realize. In this clip, we capture live events, spot emails and login tokens slipping through, and fix it instantly with Cribl Guard. A few clicks, a commit and deploy, and Guard redacts the data in real time. No complex configs. No regex nightmares. Just fast protection that keeps your telemetry clean and your security tight.

Pastries with SREs: Holding onto extra observability data and desserts

In this episode of Pastries with SREs, we dig into why you should keep all of your observability data, even if you don’t need it quite yet. We explore: With enriched logs and flexible, cost-effective storage, you can stop worrying about what you might need later and start answering questions with confidence, no matter when they arise. Additional resources.

How to Reduce Your Cloud Costs with Coroot

Cloud costs often grow quietly until they suddenly command everyone’s attention. Gartner estimates that companies overspend on cloud services by up to 70 percent, mostly because they lack clear visibility into where the money is actually being spent. Cloud invoices speak the language of infrastructure: nodes, instance types, regions, volumes, and egress. Engineering teams speak the language of services, deployments, and code.

AI and DevOps in 2025: How Autonomous Engineering Will Transform Software Operations and Reliability

DevOps started as a way to break down barriers between development and operations, but by 2025 the movement has shifted into something far more ambitious. Instead of simply speeding up releases or tightening workflows, companies are now adopting autonomous engineering systems-tools powered by AI that don't just support DevOps practices but actually carry them out.

Side-by-Side Variable Comparison for Snapshot Debugging

When you’re debugging a tricky issue in a distributed system, “what changed?” is often the most important question. You add logs, you capture data, you redeploy, and suddenly your browser is full of open tabs, copied JSON blobs, and screenshots of log lines. Comparing behavior between two requests, two users, or two releases turns into a manual, error-prone chore. Lightrun Snapshots were built to fix the data collection side of that story.

What's Special About MCP?

AI agents can interact with the world using tools. Those tools can be generic or specific. For example: Generic: Specific: The most general ones, like “run a bash command” and “read and write files” are built into the agent. More specific ones are provided through Model Control Protocol (MCP) servers. Every tool provided to the agent comes with instructions sent as part of the context.