The latest News and Information on Service Reliability Engineering and related technologies.
Navigating On-Call rotations can often feel like taming a storm of alerts and constant disruptions, leaving teams overwhelmed and stressed. Hence there is a need to streamline On-Call rotations and leverage concerned software to restore order and peace. In this guide, you'll explore practical tips, best practices, and smart strategies to transform your Incident Management process. Let's get to a more efficient On-Call experience.
Define PromQL Macros to standardize complex PromQL queries in Levitate.
A detailed comparison of Levitate and Google Managed Prometheus - Cost, Scale and Ease of Use.
Observability is being built by engineers for engineers. In reality, o11y is for all.
Let’s be honest. When you see an alert pop up on your phone, you aren’t thinking “according to section 12 of our most recent SRE handbook used at training 6 months ago I need to keep in mind who should be Incident Commander and who should be Ops Lead”. You’re an engineer at heart.