Operations | Monitoring | ITSM | DevOps | Cloud

OpenTelemetry Monitoring with Netdata

If you've standardized on OpenTelemetry (or you're heading that way), you probably know the collector gets your data out, but where it lands and how useful it is once it gets there are separate problems. Netdata now ingests both OTLP metrics and OTLP logs natively, so your OTel pipelines feed directly into the same monitoring experience as everything else in your infrastructure: same dashboards, same alerting, same query interface. No separate backends, no context switching.

Future Solving with Brian Evergreen (Or: How to Escape those AI Career Jitters)

Brian Evergreen joins the show to challenge the fear-driven narrative around AI and work. Rather than treating the future as something coming for us, Brian argues that leaders and individuals should decide what future they want to create, then work backwards. He explores why “start with the problem” thinking limits AI strategy, how visible strategy and relational leadership can unlock better transformation, and why human connection may become more valuable—not less—in an AI-enabled world. A thoughtful conversation on escaping AI career anxiety, building resilient networks, and creating value beyond efficiency.

WHOIS & RDAP Domain Lookup & Expiry Check

In this video, we’ll walk you through how to set up and configure your Whois and RDAP Domain Lookup & Expiry Checks in Uptime.com. Learn how to monitor and receive alerts before your domain expires, and protect your registration information from unauthorized modifications. We cover step-by-step instructions for setting up checks through the Uptime.com UI and via API.

Ameet Talwalkar on Building the AI Research Lab

"We're doing cutting-edge AI, focused on real translational impact: getting our research over the wall and into production." Ameet Talwalkar, Datadog's Chief Scientist, shares what it took to build the AI Research Lab from the ground up — and what makes DAIR different from traditional research teams. At Datadog, research ships. Recent work from the lab includes Toto 2.0, open-weights time series forecasting models ranked on leading benchmarks, and ARFBench, a new benchmark for evaluating AI on real incident data.

Instant Java Client SDK, no spec required!

Learn how to generate a client SDK for a production service when you have no documentation, no OpenAPI spec, and no remaining team knowledge of the original Ruby code. This demo shows you how to capture real production data from a running app and transform it into a functional Java client library in minutes. Visit proxymock.io OR speedscale.com to learn more.

Search Azure Blob data in-place with BYOS for Cribl Lake

See how Bring Your Own Storage (BYOS) in Cribl Lake allows teams to connect directly to Azure Blob Storage and instantly search data in place — without moving, duplicating, or rehydrating telemetry. In this demo, Cribl Product Manager Risk Salsa walks through setup, dataset creation, and how to run fast investigations across your Azure-hosted data using Cribl Search.

AI Might Break Open Source Differently Than You Think

AI coding agents may not replace open source libraries overnight. But Adam Arellano, Field CTO at Harness, thinks models like Mythos could expose a bigger problem: finding bugs, vulnerabilities, and edge cases faster than maintainers can keep up. That might be the real threat to tools and libraries.

Lessons From a CI/CD Supply Chain Attack at Grafana Labs

When a compromised GitHub Actions workflow targets your CI/CD pipeline, how do you respond — and what do you change so it never happens again? Nick and David from Grafana Security walk through a real supply chain incident triggered by a pull_request_target misconfiguration, showing exactly what broke, what tools caught it, and what the team rebuilt afterward.

Getting Started with gcx: A CLI for AI Agents and Grafana Telemetry | Demo

AI agents are only as useful as the context they can access. With gcx, your coding agents can connect to Grafana and query real-time production telemetry from your Cloud, Enterprise, or OSS environment. The best part: it avoids the upfront context bloat that can come with loading tools before you even send a prompt. gcx uses a CLI approach, so there’s zero token cost until your agent actually needs to run a query.

Best Practices in the Slack Experience

PagerDuty’s slack experience is evolving to help your teams organize better and resolve incidents faster. Use Triage Channels to collect telemetry and updates from your systems. Create dedicated Incident Channels for coordination and resolution. Give stakeholders the updates they need in Announcements Channels. Everyone in your organization can get the information they need easily.