%term

Latest research from Meta AI, MedRAX, and Rootly AI

Jul 22, 2025 By Rootly In Rootly

View Video

Rootly

Read more about Latest research from Meta AI, MedRAX, and Rootly AI

Balancing Reliability at the Crypto-Finance Frontier with Brian Shaw (Uphold)

Jul 3, 2025 By Rootly In Rootly

Sylvain Kalache sits down with Brian Shaw, Senior Engineering Leader at Uphold, to explore the reliability challenges that arise when operating at the intersection of traditional finance and crypto markets. Brian shares how unexpected market events can create massive traffic spikes, how their platform architecture and Kubernetes setup help them stay resilient, and why Uphold's transparency and regulatory approach make them both trustworthy and a high-profile target.

View Video

Rootly

Read more about Balancing Reliability at the Crypto-Finance Frontier with Brian Shaw (Uphold)

Benchmarking Llama 4 with GitHub Multiple Choice Benchmarks

May 9, 2025 By Rootly In Rootly

How accurately can LLMs predict how bugs were fixed? To start exploring this field, we put Llama 4 and other leading models to the test using a GitHub Multiple Choice Benchmark. Each model was given a real bug ticket and had to identify the pull request that resolved it.

View Video

Rootly

Read more about Benchmarking Llama 4 with GitHub Multiple Choice Benchmarks

Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

Apr 17, 2025 By Rootly In Rootly

In this episode, we explore how networking has shaped reliability as we know it. Marino Wijay cloud networking expert and Staff Solutions Architect at Kong shares how his journey began not as an SRE, but with cables, routers, and switches. Marino explains the evolution of the fabric holding systems together through virtualization, and how software-defined networking, which is now a key element to resilient applications.

View Video

Rootly

Read more about Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

Creating an LLM-powered Incident Diagram

Apr 17, 2025 By Rootly In Rootly

Jeba Emmanuel, Rootly AI Labs Fellow, explains how he created a tool that takes a GitHub repository and a postmortem repository to generate an incident diagram and a timeline. The solution uses a series of highly-specialized LLMs for better and more consistent results.

View Video

Rootly

Read more about Creating an LLM-powered Incident Diagram

The New Rootly Ringtones: How Research-based On-Call Sounds

Apr 17, 2025 By Rootly In Rootly

We set out to create a ringtone that wasn’t just loud—but the sound of a modern pager. Something that wakes you up, but without triggering a full-blown adrenaline spike. In this video, go behind the scenes with sound engineer Gorjão as he crafts a how research-based on-call sound sounds like.

View Video

Rootly

Read more about The New Rootly Ringtones: How Research-based On-Call Sounds

Metrics That Matter: Measuring Developer Productivity in the AI Era

Apr 9, 2025 By Rootly In Rootly

In this episode, Ryan McDonald is joined by Mark Quigley, Head of Platform Engineering at Ninety.io, for a conversation that cuts through the noise around developer productivity metrics and AI. Mark dives deep into how teams can measure what matters—without falling into the trap of turning every measure into a target. He shares how tools like Developer NPS, DORA metrics, and balanced scorecards can help teams optimize for both output and well-being—but only when framed with the right intent.

View Video

Rootly

Read more about Metrics That Matter: Measuring Developer Productivity in the AI Era

How Motive achieves 99.99% reliability with Rootly

Mar 24, 2025 By Rootly In Rootly

In the high-stakes world of fleet management, reliability isn’t a nice-to-have—it’s a necessity. That’s why Motive has invested heavily in tools and processes to ensure its systems run smoothly for over 150,000 customers and more than a million vehicles. At the center of its ability to deliver 99.99% uptime at scale is Rootly.

View Video

Rootly

Read more about How Motive achieves 99.99% reliability with Rootly

Are AI and Platforms Making SRE Obsolete? With Kaspar von Grünberg, Humanitec's CEO

Mar 24, 2025 By Rootly In Rootly

Last year, over 89% of companies claimed to have adopted platform engineering. And, in the past month, LLMs have been disrupting how we think about software development. In this context, Kaspar, asks if the role of Site Reliability Engineers is being obsolete as we know it. Kaspar argues that while SREs aren’t going anywhere, their responsibilities are evolving—fast. We talk about.

View Video

Rootly

Read more about Are AI and Platforms Making SRE Obsolete? With Kaspar von Grünberg, Humanitec's CEO

Scientific Incident Management with Dan Slimmon

Mar 13, 2025 By Rootly In Rootly

Dan Slimmon is an incident management veteran who's worked at Etsy, HashiCorp, and now leads consulting and training on pragmatic, non-bureaucratic incident response. In this episode, Dan shares his philosophy on "scientific incident response," the importance of hypothesis-driven troubleshooting, and why incidents should be seen as normal in complex systems.

View Video

Rootly

Read more about Scientific Incident Management with Dan Slimmon

Operations | Monitoring | ITSM | DevOps | Cloud

Latest research from Meta AI, MedRAX, and Rootly AI

Balancing Reliability at the Crypto-Finance Frontier with Brian Shaw (Uphold)

Benchmarking Llama 4 with GitHub Multiple Choice Benchmarks

Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

Creating an LLM-powered Incident Diagram

The New Rootly Ringtones: How Research-based On-Call Sounds

Metrics That Matter: Measuring Developer Productivity in the AI Era

How Motive achieves 99.99% reliability with Rootly

Are AI and Platforms Making SRE Obsolete? With Kaspar von Grünberg, Humanitec's CEO

Scientific Incident Management with Dan Slimmon

Monthly Archive

Follow Us