Latest News

ServiceNow + Squadcast Integration: Automate IT Ticketing and Project Tracking

Mar 4, 2022 By Nir Sharma In Squadcast

ServiceNow is a workflow automation platform used by organizations for their IT ticketing and project management needs. In contrast, Squadcast is an end-to-end incident management and SRE platform that is used by organizations for their reliability requirements.

Read Post

Squadcast

Read more about ServiceNow + Squadcast Integration: Automate IT Ticketing and Project Tracking

What SREs Can Learn from Capt. Sully: When to Follow Playbooks

Mar 4, 2022 By Andre King In Rootly

When are you smarter than your playbooks, and when are your playbooks smarter than you? That’s a question that engineers rarely step back to consider. The rational, disciplined parts of our minds tell us that the playbooks we are supposed to follow were carefully designed and tested, and that we should stick to them at all costs.

Read Post

Rootly

Read more about What SREs Can Learn from Capt. Sully: When to Follow Playbooks

Incident Response Lifecycle | A Complete Explanation

Mar 3, 2022 By Emily Arnott In Blameless

Wondering about the incident response lifecycle? We explain what it is, and how each phase helps lead to effective incident resolution. What is the incident response lifecycle? The incident response lifecycle is an organization’s framework for responding to an incident that disrupts service. The incident response lifecycle contains the following phases.

Read Post

Blameless

Read more about Incident Response Lifecycle | A Complete Explanation

Monthly Moo March 2022

Mar 3, 2022 By John Haley In Moogsoft

What a start to 2022 has been for us all. We are incredibly proud of the continuous innovation, velocity and delivery of new features and functionality. We’ve heard success story after success story from our brilliant customers, each unique in their own way and continue to collaborate with them on our roadmap. So, this March update is for you and a massive thank you. We couldn’t do it without you, and it’s been our honor to be part of your success.

Read Post

Moogsoft

Read more about Monthly Moo March 2022

Amplify Artifactory and Distribution Changes Through PagerDuty

Mar 2, 2022 By Deep Datta In JFrog

When automated software delivery runs smoothly, it can whisper, and quietly attend to itself. But when your delivery and distribution pipeline runs into a problem, it must shout. Boosting the volume of Artifactory and Distribution change events and issues through PagerDuty can help ensure they’re heard by everyone whose job it is to monitor your software delivery pipeline.

Read Post

JFrog

Read more about Amplify Artifactory and Distribution Changes Through PagerDuty

Kubernetes Health Check Using Probes

Mar 2, 2022 By Squadcast Community In Squadcast

Kubernetes is an open source container orchestration platform that significantly simplifies an application's creation and management. Distributed systems like Kubernetes can be hard to manage, as they involve many moving parts and all of them must work for the system to function. Even if a small part breaks, it needs to be detected, routed and fixed. These actions also need to be automated. Kubernetes allows us to do that with the help of readiness and liveness probes.

Read Post

Squadcast

Read more about Kubernetes Health Check Using Probes

Mastering Digital Operations Across the Enterprise

Mar 2, 2022 By Sean Scott In PagerDuty

I’m excited to announce that today, PagerDuty is taking our automation capabilities to new scale and scope as we enter into a definitive agreement to acquire Catalytic. With their technology and talented team we accelerate the delivery of enterprise-wide process automation that manages no-code workflows across the business, broadly applicable to any workflow, for any employee.

Read Post

PagerDuty

Read more about Mastering Digital Operations Across the Enterprise

Postmortems Now Called Retrospectives in Blameless

Mar 2, 2022 By Blameless In Blameless

Something big happened at Blameless this month — our “Postmortem” feature was updated to its new name, “Retrospective”. To the naysayer, I suppose you’re thinking, This seems trivial. Different teams call it different names anyway, so why bother making the change? First let me say, thank you for reading our blog and I hope you finish this one through to the end. Now, allow me to explain our reasoning and why we’re excited about this update.

Read Post

Blameless

Read more about Postmortems Now Called Retrospectives in Blameless

Customizing Error Pages (Nginx Ingress Controller)

Mar 2, 2022 By Deepak Kumar In Zenduty

The most common way to do it, which is part of the offical solution is to create a Docker image server capable of responding to any request with 404 content, except /healthz and /metrics. This could be an Nginx instance. /healthz should return 200 /metrics is optional, but it should return data that is readable by Prometheus in case you are using it for k8s metrics. Note: Nginx can provide some basic data that Prometheus can read. /returns a 404 with your custom HTML content.

Read Post

Zenduty

Read more about Customizing Error Pages (Nginx Ingress Controller)

Alert Fatigue in SRE: What It Is & How To Avoid It

Mar 1, 2022 By Emily Arnott In Blameless

Wondering about alert fatigue? We describe what it is, how it affects software development teams, and how to avoid it. What is alert fatigue? Alert fatigue is the phenomenon of employees becoming desensitized to alert messages because of the overwhelming volume they receive, and the number of false positives they receive. The risk with alert fatigue is that important information will be overlooked or ignored.

Read Post

Blameless

Read more about Alert Fatigue in SRE: What It Is & How To Avoid It

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

ServiceNow + Squadcast Integration: Automate IT Ticketing and Project Tracking

What SREs Can Learn from Capt. Sully: When to Follow Playbooks

Incident Response Lifecycle | A Complete Explanation

Monthly Moo March 2022

Amplify Artifactory and Distribution Changes Through PagerDuty

Kubernetes Health Check Using Probes

Mastering Digital Operations Across the Enterprise

Postmortems Now Called Retrospectives in Blameless

Customizing Error Pages (Nginx Ingress Controller)

Alert Fatigue in SRE: What It Is & How To Avoid It

Monthly Archive

Follow Us