Operations | Monitoring | ITSM | DevOps | Cloud

SRE

The latest News and Information on Service Reliability Engineering and related technologies.

Sponsored Post

Reducing Security Incidents: Implementing Docker Image Security Scanner

Are you utilizing Docker to deploy your applications? If so, you're not alone. The use of Docker has skyrocketed in popularity in recent years. While it offers numerous benefits, it also introduces new security risks that need to be addressed. But, why is reducing security incidents so important? Simple - the cost of a security breach can be devastating. From lost customer trust to financial losses, the consequences of a security incident can be severe. That's why it's crucial to take steps to prevent them from occurring in the first place. Enter Docker image security scanners.

Webinar on 'Evolution of Incident Management from On-Call to SRE' | Squadcast

This Incident Management has evolved considerably over the last decade, more so in the last few years. What was traditionally limited to having just an in-house on-call team and an alerting system, has now grown well beyond that to ensure Automation, Collaboration, Transparency, and Retrospection are deeply entrenched in Incident Response.

Site Reliability Engineer: Responsibilities, Roles and Salaries

DevOps gained popularity in order to combat siloed workflows, decreased collaboration and a lack of visibility across the software development lifecycle. While establishing a culture of DevOps has helped teams collaborate better and deliver reliable software faster, DevOps teams don’t necessarily have someone specifically dedicated to developing systems that increase site reliability and performance. That’s where a site reliability engineer (SRE) comes into the picture.

Strategies for Kubernetes Cluster Administrators: Understanding Pod Scheduling

Kubernetes has revolutionized container orchestration, allowing developers to deploy and manage applications at scale. However, as the complexity of a Kubernetes cluster grows, managing resources such as CPU and memory becomes more challenging. Efficient pod scheduling is critical to ensure optimal resource utilization and enable a stable and responsive environment for applications to run in.

Deduplication Rules | Reduce Alert Noise by Clustering Similar Alerts I Squadcast

Alert Deduplication can help you reduce alert noise by organising and grouping alerts. It also provides easy access to similar alerts when needed. This video on Alert Deduplication rules will help you define Deduplication Rules for each Service in Squadcast. Alerts will get deduplicated when these rules evaluate true for an incoming incident.