%term

The latest News and Information on Observabilty for complex systems and related technologies.

Multi-tiered Observability: A Practical Way to Handle Diverse Workloads

May 8, 2026 By Pablo Fernandez In VictoriaMetrics

Observability in large companies is rarely one-size-fits-all. The VictoriaMetrics topologies guide shows why different deployment patterns are needed as scale, isolation, and reliability requirements grow. Different workloads require different trade-offs: some need long retention for audits and trend analysis, while others need higher resolution for debugging. Business-critical systems also demand dependable alerting and high availability, often with several 9s of reliability.

Read Post

VictoriaMetrics

Read more about Multi-tiered Observability: A Practical Way to Handle Diverse Workloads

Why Blast Radius Analysis Does Not End When Alerts Fire

May 7, 2026 By Lightrun Team In Lightrun

Modern distributed systems fail in ways that can bypass even well-designed isolation patterns. When a failure is actively propagating across services at four in the morning, the question shifts from “how do we limit the blast radius” to “how do we confirm what it actually is.” Monitoring shows which services are in the impact zone, but it cannot show what code path caused the failure to spread, or whether it has stopped.

Read Post

Lightrun

Read more about Why Blast Radius Analysis Does Not End When Alerts Fire

Span or Attribute in OpenTelemetry Custom Instrumentation

May 7, 2026 By Jessica Kerr (Jessitron) In Honeycomb

TL;DR: Attribute. More information on one event gives us more correlation power. It’s also cheaper. When you want to add some information to your tracing telemetry, you could emit a log, create a span, or add a piece of data to your current span. Adding a piece of data to your current span is the best! Usually.

Read Post

Honeycomb

Read more about Span or Attribute in OpenTelemetry Custom Instrumentation

Observability and Security for the AI Era

May 7, 2026 By Datadog In Datadog

Datadog has always been driven by a broader vision of helping teams understand and operate complex systems. In this session, you’ll hear from Michael Whetten, Product SVP, and Abrar Hussain, Senior Director, Product Management, as they share the latest updates across the Datadog product suite and discuss how that vision continues to shape the platform’s evolution and support the next generation of AI-driven applications.

View Video

Datadog

Read more about Observability and Security for the AI Era

How to Prevent AI Agents From Deleting Production Data

May 6, 2026 By Lightrun Team In Lightrun

There’s a new question teams are asking. How can we prevent AI agents from deleting production. When Cursor deleted PocketOS’s entire production database in nine seconds, the agent wasn’t malfunctioning. It had full technical capability, but it was inferring operational authority from static code rather than live environment state. That gap between capability and context is the root cause. This article breaks down exactly how that happens, and what runtime visibility does to stop it.

Read Post

Lightrun

Read more about How to Prevent AI Agents From Deleting Production Data

The cost of knowledge

May 6, 2026 By Ofri Grushka In Coralogix

In the world of observability, “cardinality” has become a heavy word. It is a ghost used to justify skyrocketing bills or degraded query performance. When cardinality rises, the advice is almost always the same: reduce it. Drop your labels, or reduce the dimensions. It is usually framed as “optimization.” Every label you add to a metric is a dimension of knowledge. Each one gives you a way to slice, compare, and explain the chaos of production.

Read Post

Coralogix

Read more about The cost of knowledge

AI Factory Observability

May 5, 2026 By Virtana In Virtana

View Video

Virtana

Read more about AI Factory Observability

Introducing the Coralogix CLI: Headless Observability for Every Agent

May 5, 2026 By Coralogix Team In Coralogix

This article is a high-level overview of the Coralogix CLI. For a deeper look at how it works in practice, read the full technical deep dive here. Agent-driven investigation sounds simple: read the alert, query the data, return the cause. In reality, most agents either overload their context window with raw logs or guess at queries and return incorrect results.

Read Post

Coralogix

Read more about Introducing the Coralogix CLI: Headless Observability for Every Agent

The one pipeline change that cuts costs AND improves visibility

May 5, 2026 By Cribl In Cribl

Your SIEM and observability tools are expensive, and you're probably using them for things a data lake could handle for a fraction of the cost. Let's fix that.

View Video

Cribl

Read more about The one pipeline change that cuts costs AND improves visibility

Moving Beyond SolarWinds: A Guide to Modern Observability

May 5, 2026 By Sofia Burton In LogicMonitor

Industry-leading observability experts provide strategic guidance on why and how modern IT teams are successfully moving beyond SolarWinds to more resilient, cloud-native platforms. IT teams running SolarWinds often know the pain points well before they start evaluating alternatives: separate modules for different monitoring needs, a self-hosted deployment model that requires ongoing maintenance, and pricing that gets harder to predict after each acquisition.

Read Post