%term

You Can't Fix What You Don't Measure: Observability in the Age of AI with Conor Bronsdon

Nov 5, 2025 By Rootly In Rootly

Only 50% of companies monitor their ML systems. Building observability for AI is not simple: it goes beyond 200 OK pings. In this episode, Sylvain Kalache sits down with Conor Brondsdon (Galileo) to unpack why observability, monitoring, and human feedback are the missing links to make large language model (LLM) reliable in production.

View Video

Rootly

Read more about You Can't Fix What You Don't Measure: Observability in the Age of AI with Conor Bronsdon

Same code, same infra but your model is now broken #ai #devops

Oct 30, 2025 By Rootly In Rootly

View Video

Rootly

Read more about Same code, same infra but your model is now broken #ai #devops

Hiring SREs in the AI era w/ Weights & Biases

Oct 14, 2025 By Rootly In Rootly

View Video

Rootly

Read more about Hiring SREs in the AI era w/ Weights & Biases

The End of "Good Code"? AI, Throughput, and Reliability with CircleCI CTO Rob Zuber

Sep 10, 2025 By Rootly In Rootly

Is “good code” still the right measure of engineering success in an AI-driven world? In this episode of *Humans of Reliability*, Rob Zuber, CircleCI CTO, joins Sylvain to explore how coding assistants are reshaping developer workflows and changing what teams value. Rob shares what he’s seeing across CircleCI’s customer base: a clear boost in throughput, new bottlenecks shifting from code creation to code review, and the rise of “vibe coding,” where engineers trust AI-generated code they may not fully understand.

View Video

Rootly

Read more about The End of "Good Code"? AI, Throughput, and Reliability with CircleCI CTO Rob Zuber

The Art of Incident Management #sre

Sep 9, 2025 By Rootly In Rootly

Read our post: https://rootly.com/blog/the-art-of-incident-management-part-i

View Video

Rootly

Read more about The Art of Incident Management #sre

Connectivity Layer in Agentic AI w/ Alloy Automation #ai

Sep 8, 2025 By Rootly In Rootly

View Video

Rootly

Read more about Connectivity Layer in Agentic AI w/ Alloy Automation #ai

What companies get wrong about LLM evals w/ Groq

Sep 4, 2025 By Rootly In Rootly

View Video

Rootly

Read more about What companies get wrong about LLM evals w/ Groq

Frontline Reliability: Protecting User Journeys with SLOs with Shery Brauner (Razor, ex-Zalando)

Aug 20, 2025 By Rootly In Rootly

What does it really take to move from firefighting incidents to building reliability at scale? In this episode of Humans of Reliability, Shery Brauner (Razor, ex-Zalando) shares her unique journey from frontend and backend engineering to leading site reliability practices. She explains why protecting the user journey is the key to effective incident management, how SLOs cut through noisy alerts, and why observability must come first.

View Video