Operations | Monitoring | ITSM | DevOps | Cloud


Ask the Experts: Observability: What Can the Frontend Steal From the Backend?

What is the biggest value of #observability as practiced on the #backend that you are excited to see taken up as more #frontend #developers start practicing observability on their own? Featuring: Winston Hearn, Frontend Observability Expert and Hazel Weakly, Web Developer and #SRE.

Ask the Experts: Distributed Tracing, OpenTelemetry, and Connecting Your Frontend to Your Backend

While baggage isn’t required for distributed tracing, it is required for carrying metadata between services. How will the observability community address that and make it easier over time? Featuring: Winston Hearn, Frontend Observability Expert and Hazel Weakly, Web Developer and SRE.

Investigating Mysterious Kafka Broker I/O When Using Confluent Tiered Storage

Earlier this year, we upgraded from Confluent Platform 7.0.10 to 7.6.0. While the upgrade went smoothly, there was one thing that was different from previous upgrades: due to changes in the metadata format for Confluent’s Tiered Storage feature, all of our tiered storage metadata files had to be converted to a newer format.

Independent, Involved, Informed, and Informative: The Characteristics of a CoPE

As our Field CTO Liz Fong-Jones says, production excellence is important for cloud-native software organizations because it ensures a safe, reliable, and sustainable system for an organization’s customers and employees. A CoPE helps organizations cultivate the practices and tools necessary to achieve that consistently. In part one of our CoPE series, we analogized the CoPE with safety departments.

Virtualizing Our Storage Engine

Our storage engine, affectionately known as Retriever, has served us faithfully since the earliest days of Honeycomb. It’s a tool that writes data to disk and reads it back in a way that’s optimized for the time series-based queries our UI and API makes. Its architecture has remained mostly stable through some major shifts in the surrounding system it supports, notably including our 2021 implementation of a new data model for environments and services.

Empowering Engineering Excellence: Achieving a 26% Reduction in On-call Pages at Amperity with Modern Observability for Logs

Amperity required an observability partner to facilitate their transition into the modern engineering era as their previous tooling struggled to support their growth strategy. When customer data is scattered everywhere, how do you put the pieces together to get an accurate customer 360° view? That’s the power of Amperity’s customer data platform (CDP), and the company has been driving customer data innovation for nearly a decade.

The Cost Crisis in Metrics Tooling

In my February 2024 piece The Cost Crisis in Observability Tooling, I explained why the cost of tools built atop the three pillars of metrics, logs, and traces—observability 1.0 tooling—is not only soaring at a rate many times higher than your traffic increases, but has also become radically disconnected from the value those tools can deliver. Too often, as costs go up, the value you derive from these tools declines.