Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Harnessing Network Observability to Enhance Grid Resilience

Within the utility sector, a lot is changing. Utilities continue to pursue digital transformation, altering the way services are delivered and operations are managed. What hasn’t changed is the criticality of the services provided. These organizations deliver essential resources like natural gas, electricity, and water—services that we as consumers rely upon constantly for our comfort, sustenance, communications, and more.

Inside the Observability Journey: Lessons from CarGurus, Nearform & More

Join us for a dynamic panel from Observability Sessions Boston where leaders from CarGurus, Nearform, and Grafana Labs share their real-world experiences with observability. In this candid discussion, David Frankel (CarGurus) and Joe Szodfridt (Nearform) delve into the challenges of implementing scalable observability practices, moving from centralized models to federated teams, and navigating cloud migration with a focus on performance and cost.

Using the OpenTelemetry Operator to boost your observability

If you’ve ever wrangled sidecars or sprinkled instrumentation code just to get basic trace data, you know the setup overhead isn’t always worth the payoff. But what if it was… just easier? That’s where the OpenTelemetry Operator for Kubernetes steps in… and it plays great with Coralogix out of the box!

How to implement business observability

It sounds simple: You define metrics for success, you track them, and if they fail, you fix them. For decades, this was how businesses monitored their systems. However, a reactive monitoring approach, which alerts businesses about failures only after the issue has already impacted operations, became insufficient as digital architectures grew more complex.

Observability 2.0 in the Real World: Lessons from SimpliSafe's Engineering Journey

In this candid and insightful talk from Observability Sessions Boston, Laban Eilers, a platform engineer at SimpliSafe, takes us on a practical deep dive into the evolution of observability—from the traditional “three pillars” model to the emerging promise of Observability 2.0.

Is There an Existential Crisis in Network Observability?

We've all been there. Users report that applications are slow, calls are dropping, or that "the internet is broken." Yet, a glance at the network dashboards shows a sea of green—latency looks acceptable, packet loss is minimal, and bandwidth seems fine. This common scenario highlights a fundamental challenge in network observability: the perceived disconnect between the technical measurements we gather and the actual experience of the people using our digital services.

Sneak Peek: MetricFire's New Logging Tool for Scalable, Open-Source Observability

Take a first look at MetricFire’s brand-new logging tool — designed to simplify log ingestion, storage, and visualization using open-source components like Loki, Python, Telegraf and Grok. Collect logs, search across services, and correlate them with your metrics — all inside your existing Hosted Graphite environment. Whether you're an SRE, DevOps engineer, or running logs on a budget, this sneak peek reveals how MetricFire is evolving toward full observability.

Logz.io AI Agents: Transforming Observability Through Intelligent Automation

Let’s be honest. AI features can sound cool on paper, but too many tools overpromise and underdeliver. At Logz.io, we didn’t want to build “yet another AI chatbot.” We wanted to create something our engineers and yours would actually use when incidents hit, logs explode, or someone asking, “What just happened to production?” Here’s how our AI Agent evolved from a basic chat interface to an incident-resolving, log-analyzing, doc-digging, context-aware assistant.