Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

SRE Report 2023: Are we Aligned? Yes. No. Maybe.

Each year of the SRE Report, there’s a trend or anti-pattern that leaps out and makes us pause and reflect. Last year, for example, we found a huge drop in global toil levels. With the whole world working from home for a full year, it made sense that global toil levels would drop, right? But this year, despite the great reopening underway, toil levels dropped even further - it's a paradox, one which no doubt will require its own scrutiny.

Lessons from the CircleCI Security Incident

In some respects, security and reliability are competing priorities. Security controls may reduce reliability, and responding to security incidents may require mission-critical systems to be paused or shut down until they're secure. The recent security incident involving CircleCI, however, shows that it's not always necessary to choose between prioritizing security or reliability.

Failure Analysis: Engineering incidents are a bigger problem than you think

Engineering incidents can be quite harmful for companies, both in terms of financial costs and reputational damage. In some cases, engineering incidents can even put people's lives at risk, which can have serious legal and moral implications for the company involved.
Sponsored Post

SLA Vs SLO: Tutorial & Examples

Service level agreements (SLA) and service level objectives (SLO) are increasing in popularity because modern applications rely on a complex web of sub-services such as public cloud services and third-party APIs to operate, making service quality measurement an operational necessity for serving a demanding market. This article focuses on the similarities and differences between SLAs and SLOs, explains the intricacies involved in implementing them, presents a case study, and finally recommends industry best practices for implementing them.

Looking back at our journey through 2022

We are on the cusp of breaking into 2023🗓️with a bag full of interesting memories. Before we wrap up this year end's celebrations we'd like to look back and highlight some notable events that took place at Squadcast. ‍ Squadcast has grown leaps and bounds over the 12 months in our journey towards becoming an integrated Reliability Workflow platform. 😎

Squadcast + Hund Integration: A Simplified Approach for effective Alert Routing

Hund is a versatile Service Monitoring & Communication tool. It helps monitor services and keeps your audience informed about any status changes automatically through a status page. If you use Hund for monitoring and management requirements, you can integrate it with Squadcast, an end-to-end incident response tool, to route detailed alerts from Hund to the right users in Squadcast.

15 DevOps and SRE Tools you Should Know About in 2023

With the constantly evolving landscape of technology, professionals in the DevOps and SRE fields need to stay up-to-date and knowledgeable about the tools and practices driving the industry forward. Whether you are just starting your career or have been working in DevOps or SRE for years, this post will provide valuable insights and information on the tools you should be familiar with as we head into 2023.

Getting Amazon GuardDuty alerts via SNS Endpoint

Monitoring your infrastructure and safeguarding it against threats is not easy. Setting up the infrastructure, monitoring, collecting and analyzing information for threat detection, is indeed a cumbersome process. This is where a security monitoring service like Amazon GuardDuty can help. In this blog, we will explore Amazon GaurdDuty service and discuss how integrating it with Squadcast can help you route alerts to the right users for quick and efficient incident response.