Introducing o11y-bench: an open benchmark for AI agents running observability workflows

Evaluating agents is hard. Verifying observability tasks is harder. Yes, AI agents have gotten dramatically and quantifiably better at coding and tool use, but observability presents a different kind of challenge. In a real incident, the hard part is rarely just writing a query. It's deciding which signal matters, figuring out whether a spike is noise or symptom, correlating metrics with logs and traces, and sometimes making a change in Grafana without breaking the dashboard another engineer depends on.

Grafana 13 release: get value from your data faster, manage operations at scale, and more!

Who says 13 is unlucky? With the release of Grafana 13, we're giving the community the most streamlined, flexible, and intuitive Grafana experience yet. Unveiled during the opening keynote of GrafanaCON 2026, the latest major release is all about helping you get value from your data faster, whether you’re spinning up dashboards, operating Grafana at scale, or extending the platform as your requirements change. Download Grafana 13.

Git Sync: Observability as code built for scale | Demo | Grafana Labs

In this video, Fabrizia Rossano and Roberto Jiménez demonstrate Git Sync, a feature that provides you with the power of Git version control right in your Grafana instance. Git Sync enables you to submit changes in your dashboards as pull requests and get them reviewed by your team directly from Grafana or from Git.

Grafana 13 TL;DR - What's New (and Worth Your Time)

Grafana 13 is here! In this video, we walk through the biggest updates and improvements, from faster ways to build dashboards to new features that make Grafana easier to manage at scale. We cover things like: If you’ve ever struggled with broken dashboards, messy layouts, or just getting started from scratch, this release focuses on making those workflows a lot smoother. This is a TL;DR, so we’re just scratching the surface—but it should give you a solid sense of what’s new and what’s worth checking out.

Power BI Dashboard Best Practices for Data Engineers and BI Developers

A strong Power BI dashboard is not built solely on visuals. For data engineers and BI developers, the dashboard is the final expression of a much larger analytics system. Its quality depends on the data model's structure, the discipline of the transformation layer, the clarity of the DAX logic, the dataset's performance, and the security model governing access.

Monitor Databricks with Grafana Cloud for instant visibility into your workloads

If you're running Databricks workloads, you've probably asked yourself these types of questions: How much is this costing me? Why did that job fail last night? Why are my dashboard queries suddenly slow? We've been there, too. Databricks is fantastic for data engineering, ML, and analytics. But once you start running jobs, pipelines, and SQL queries at scale, you need a way to keep tabs on what's happening. That's why we built the Databricks integration for Grafana Cloud.

Honeybadger Insights Parameterized Queries

Make your Honeybadger Insights dashboards and queries dynamic with parameterized queries. In this short walkthrough, we'll take a static system dashboard — showing load average, memory, and disk usage across a fleet of hosts — and turn it into an interactive view you can filter to a single host with one click. What you'll see: Parameterized queries are a simple way to build one dashboard that serves many views — no duplication, no extra widgets, just a shareable URL.

Sentry Built AI Dashboards: Monitor Your AI Agents End-to-End

Building AI applications? There's a lot more to monitor beyond errors. With tracing enabled, Sentry's built-in AI Dashboards give you deep visibility into how your agents are actually performing. This video walks through three key dashboard views: You'll also see how to drill from a dashboard widget straight into the trace explorer to pinpoint the root cause of errors, how to duplicate and customize dashboards to fit your needs, and how to set up monitors with alert thresholds - like getting notified if your LLM calls exceed 20 seconds.