Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

Postmortems and Retrospectives (class SRE implements DevOps)

Even after a service has been restored, SREs still have a bit of work to do. In this video, Liz and Seth discuss the postmortem process that SREs follow. Blameless postmortems and retrospectives are key to learning from failures and preventing recurrence. You will learn about the importance of conducting a postmortem, strategies for conducting a blameless postmortem, and techniques for trending retrospectives across your entire organization to gain better insights to prevent service disruptions in the future.

Disruption Detector and Real Time Monitoring with Stackdriver (Cloud Next '18)

Aja built an interactive disruption detector panel for attendees at the Google I/O Conference to intentionally cause errors to happen to the system. This demo highlights the amazing real time monitoring feature of Stackdriver as it tracks all incoming errors and make things easier for developers to pinpoint the issue. Watch the video to learn more.

Incident Management (class SRE implements DevOps)

In the previous video, Liz and Seth discussed how to make systems observable and how observability helps us diagnose failing systems, but didn't cover what to do when an incident grows beyond the ability of one person to do it all. In this video, you learn about the most important part of the incident management process – humans.

Cloud OnAir: CE TV: Application Observability with LightStep

Observability remains a key challenge as customers embrace DevOps. Join Daniel "Spoons" Spoonhower, the CTO and Founder of Lightstep, a Google Cloud customer, and Yuri Grinshteyn, a Google Cloud Customer Engineer to learn about how Lightstep was built on Google Cloud to enable you to monitor what matters most and diagnose anomalies within seconds across web, mobile, monoliths and microservices.

Operating Serverless Apps with Google Stackdriver (Cloud Next '18)

For a complete serverless solution you need to be able to monitor your serverless apps, learn when they are not functioning correctly, and debug them when necessary. This session teaches you how to use Google Stackdriver to monitor and troubleshoot serverless apps.