Operations | Monitoring | ITSM | DevOps | Cloud

Adaptive Metrics in Action: How The Trade Desk Optimized Observability Costs | Grafana Labs

Managing observability costs at scale is no easy task — especially when metrics volume grows fast. In this talk, Paul Givens, Head of Observability at The Trade Desk, shares how they implemented Adaptive Metrics to control costs without sacrificing visibility. How Adaptive Metrics works to reduce cardinality and cost Real-world implementation lessons from a high-scale AdTech environment Key takeaways for teams managing large Prometheus-like metric sets.

Kubernetes Monitoring: One view for observing all your storage volumes

If you want to observe your entire Kubernetes environment, you need visibility into all of your resources, including storage volumes. But monitoring Kubernetes storage hasn’t always been easy, especially if you wanted to see how it related to other parts of your infrastructure.

How to prevent performance bottlenecks in Google Compute Engine: CPU spikes, RAM waste, and network overload

Cloud computing is all about efficiency. You need to get the most out of your resources without overspending or causing performance issues. For example, if you’re running virtual machines in Google Compute Engine, you need to size your instances correctly, optimize your workloads, and monitor your network traffic to prevent unexpected failures. However, when resources aren’t properly managed, things can quickly spiral out of control.

How to use data source variables in Grafana dashboards

Data source variables let you change where Grafana looks for data without having to create duplicate dashboards. So for example, if you have multiple different Prometheus databases, you can have one dashboard and use a data source variable to choose which Prometheus that dashboard uses. We'll look at how to set these up in this video. Grafana Cloud is the easiest way to get started with Grafana dashboards, metrics, logs, and traces. Our forever-free tier includes access to 10k metrics, 50GB logs, 50GB traces and more. We also have plans for every use case.

Meet Ted Young, OpenTelemetry co-founder and the newest Grafanista

In just a few short years, OpenTelemetry has become the second largest CNCF project behind Kubernetes and is well on its way to becoming an industry standard for collecting and exporting telemetry data. And with KubeCon + CloudNativeCon Europe 2025 just around the corner, there’s no one better to talk to about the state of OpenTelemetry than Ted Young. Ted is the co-founder of OpenTelemetry and serves on the OpenTelemetry Governance Committee.

License to observe: Why observability solutions need agents

Note: The original version of this blog post published on ;login: on February 24, 2025. When architecting the flow of observability data such as logs, metrics, traces or profiles, you’ve likely noticed that most solutions ask you to deploy an agent or collector. Understandably, you might be hesitant to deploy yet another application just so you can get your data into your storage system of choice.

Grafana 11.6 release: new data visualization features, LBAC for metrics data sources, alerting updates, and more

Our engineering team is hard at work on Grafana 12, the next major release of the open source data visualization platform that we’re launching at GrafanaCON this May, but in the meantime, Grafana 11.6 is officially here — and there’s a lot to be excited about. The latest minor release delivers a number of new dashboarding features, including one-click data links and actions, along with other notable updates related to security, alerting, and more.

The state of observability in 2025: a deep dive on our third annual Observability Survey

Across companies of all shapes and sizes, observability practices are maturing and getting attention at the highest levels. At the same time, cost and complexity continue to hinder efforts as teams look to emerging tools to help simplify their processes in hopes of better outcomes. With so much in flux, we went into our third annual Observability Survey hoping to get a window into the ways the community is approaching observability and where it wants it to go next.