Evolution of Site Reliability - Incidentally Reliable with Manoj Sebastian
Catch Manoj Sebastian(ex-Flipkart, Amazon, Atlassian, Intuit, Yahoo) talk about The Evolution of SRE through 20 years, Incident Response and Post Incident Culture at Big Tech and the Future of Reliability with AI ramping up at full speed.
The freshest podcast for Site Reliability Engineers, hosted by Vishwa and Shubham from Zenduty.
Zenduty is a revolutionary incident management platform that gives you greater control and automation over the incident management lifecycle.
Learn more: https://www.zenduty.com/
00:00 - Intro
01:12 - India's first product companies
06:50 - Yahoo's Culture Under David Filo
10:01 - The Inception of Site Reliability
14:00 - When and How to Setup SRE
16:00 - Hiring Patterns for SRE
19:51 - Amazon's Amazing SRE Processes
24:21 - Reliability at SaaS vs e-commerce
26:27 - Heavy Crown of a Reliability Leader
31:34 - Reliability Leaders at the Business Table
36:25 - Marrying Business Metrics and Engineering
40:09 - Observability in the next 10 years
45:42 - Will SREs remain relevant?
49:26 - Developing Empathy for Customers
54:28 - Projecting Reliability to Customers
57:53 - War-room Stories
1:04:31 - Work Life Balance in the SRE World