Practical Guide to SRE: Infrastructure-as-Code (IaC)
An overview of how SREs can benefit from Infrastructure-as-Code.
An overview of how SREs can benefit from Infrastructure-as-Code.
Although every company can benefit from SREs, some need SREs more than others.
Six tips on how Site Reliability Engineers (SREs) can prepare for the reliability challenges of Black Friday and Cyber Monday 2021
A history of Site Reliability Engineering from its origins at Google in 2003 to the present.
Follow these steps to write a great SRE job resume.
An explanation of the meaning of SLA, SLO and SLI, and how SREs should use each concept to manage reliability.
SREs and SWEs complement each other, but they perform different tasks and focus on different priorities.
Learn about the key roles within an incident response team, as well as optional incident roles you may not have thought about.
A comparison of EKS, AKS, GKE, Rancher and OpenShift from an SRE’s perspective.
Facebook’s October 2021 outage was the type of event that gives SREs nightmares: A series of critical business apps crashed in minutes and remained unavailable for hours, disrupting more than 3.5 billion users around the world and costing about 60 million dollars. As incidents go, this was a pretty big one.