%term

Curb alert noise for better productivity : How-To's and Best Practices

Nov 3, 2020 By Squadcast In Squadcast

On the quest to provide the best uptime, software platforms depend on complex interconnected microservices. This often leaves them vulnerable to cascading failures creating a massive deluge of alerts from monitoring tools when things go wrong. In this blog, we explore how Squadcast can be configured to curb alert noise for better productivity with the help of the most advanced deduplication features.

Read Post

Squadcast

Read more about Curb alert noise for better productivity : How-To's and Best Practices

Choosing SLOs that users need, not the ones you want to provide

Oct 1, 2020 By Squadcast In Squadcast

In our latest two-part series blog, Adam Hammond, talks about how you can build sustainable SLOs that are appropriate for your users, your technology platform, and your business which in turn will help you make your systems robust, your customers happy, and your business boom.

Read Post

Squadcast

Read more about Choosing SLOs that users need, not the ones you want to provide

Keep track of your on-call responsibilities

Aug 19, 2020 By Squadcast In Squadcast

On-call Engineers are the first line of defense when an outage occurs ensuring customer-impacting services are quickly noticed & resolved. Our latest blog outlines some of the use cases that can help to avoid the pitfalls in organization’s on-call rotation.

Read Post

Squadcast

Read more about Keep track of your on-call responsibilities

Keeping your teams and customers in the loop during downtime

Aug 12, 2020 By Squadcast In Squadcast

Making your organization more transparent is not always an easy process. In our latest blog post, Adam Hammond, shares some tips and tools that can help you get started when it comes to keeping your teams and customers in the loop during downtime.The core message is that you need to make communication a cultural pillar of your organization.

Read Post

Squadcast

Read more about Keeping your teams and customers in the loop during downtime

Nishant Singh shares his thoughts on being an SRE

Aug 5, 2020 By Squadcast In Squadcast

Nishant Singh is an SRE at LinkedIn based in Bangalore. Currently, he is working towards building and maintaining applications that improve the overall MTTD (Mean time to detect) and MTTR (Mean time to recover) of the site. He likes to build services and play with the latest technologies. Before LinkedIn, Nishant worked for a few companies in the security and e-commerce domain as a DevOps engineer where he was primarily responsible for building infrastructure, deployment pipelines and security.

Read Post

Squadcast

Read more about Nishant Singh shares his thoughts on being an SRE

Evan Niedojadlo from Peddle shares his thoughts on being an SRE

Jul 27, 2020 By Squadcast In Squadcast

Evan Niedojadlo is an SRE at Peddle based in Austin, TX. He is currently on a small team and works on the SRE, Ops, and Security area of the organization. In his free time, he enjoys building communities, reading, music, helping others learn, and being outside.

Read Post

Squadcast

Read more about Evan Niedojadlo from Peddle shares his thoughts on being an SRE

Understanding the landscape of AWS compute

Jul 10, 2020 By Squadcast In Squadcast

In the second part of our "SLOs for AWS-based infrastructure" blog , Gigi Sayfan dives deeper into understanding the landscape of AWS compute by using the lens of Kubernetes to compare and contrast & covers in detail setting of SLOs for ECS, EKS, Fargate, and Lambda based services.

Read Post

Squadcast

Read more about Understanding the landscape of AWS compute

SLOs for AWS-based infrastructure

Jul 8, 2020 By Squadcast In Squadcast

In our latest two-part series blog, Gigi Sayfan, author of “Mastering Kubernetes”, discusses managing complex infrastructure on AWS with an eye towards SLOs (service level objectives). Though there are many ways to discuss the management of infrastructure, in this two-part series, he covers SLOs for AWS, Observability on AWS, Quotas Limits, and Optimizing cost on AWS and in the second part, he uses the lens of Kubernetes to compare and contrast compute infrastructure on AWS with Kubernetes.

Read Post

Squadcast

Read more about SLOs for AWS-based infrastructure

Kubernetes Operators for Automated SRE

May 27, 2020 By Squadcast In Squadcast

It can be quite challenging for an SRE team to maintain the well-being of a large-scale Kubernetes based system with hundreds or thousands of services. In this blog post, Gigi Sayfan, author of “Mastering Kubernetes”, outlines the SRE challenge and how we can achieve the ultimate goal of automated SRE with Kubernetes operators.

Read Post

Squadcast

Read more about Kubernetes Operators for Automated SRE

On-call On-boarding Checklist

May 20, 2020 By Squadcast In Squadcast

And it starts with the company culture. Irrespective of how small or large your team is, it’s wise to invest some time in creating a good on-call onboarding plan. A humane on-call is the mark of a good engineering culture. Being on-call means that you’re expected to be reachable for any issues that may occur during your shift. It’s easy to lose any and all motivation by just anxiously anticipating that mid-dinner ping.

Read Post

Squadcast

Read more about On-call On-boarding Checklist

Operations | Monitoring | ITSM | DevOps | Cloud

Curb alert noise for better productivity : How-To's and Best Practices

Choosing SLOs that users need, not the ones you want to provide

Keep track of your on-call responsibilities

Keeping your teams and customers in the loop during downtime

Nishant Singh shares his thoughts on being an SRE

Evan Niedojadlo from Peddle shares his thoughts on being an SRE

Understanding the landscape of AWS compute

SLOs for AWS-based infrastructure

Kubernetes Operators for Automated SRE

On-call On-boarding Checklist

Monthly Archive

Follow Us