%term

The latest News and Information on Service Reliability Engineering and related technologies.

Healthchecks + Squadcast Integration: Routing Alerts Made Easy

Aug 26, 2022 By Vishal Padghan In Squadcast

Healthchecks is a cron job monitoring service which listens to HTTP requests and email messages ("pings") from your cron jobs and scheduled tasks ("checks"). It lets you update your job to send an HTTP request to the ping URL every time the job runs. When your job does not ping Healthchecks.io on time, then you will receive an alert! If you use Healthchecks for your monitoring needs, you can now integrate it with Squadcast to route detailed alerts from Healthchecks to the right users in Squadcast.

Read Post

Squadcast

Read more about Healthchecks + Squadcast Integration: Routing Alerts Made Easy

Introduction to Service Catalog | Service Ownership | Service Classification Squadcast

Aug 26, 2022 By Squadcast In Squadcast

To make service management a breeze, we bring to you our improved Service Catalog. The Service Catalog is designed to improve Service Classification and bring more transparency to Service Ownership within your org. This video explains how a consolidated summary of all active services from a single dashboard can help you better track your service health.

View Video

Squadcast

Read more about Introduction to Service Catalog | Service Ownership | Service Classification Squadcast

What are Runbooks? And why are they needed?

Aug 25, 2022 By Vardhan NS In Squadcast

Imagine being an Ops engineer in a team just struck by tragedy. Alarms start ringing, and incident response is in full force. It may sound like the situation is in control. WRONG! There's panic everywhere. The on-call team is scrambling for the heavenly door to redemption. But, the only thing that doesn't stop - Stakeholder Inquiries. This situation is bad. But it could be worse. Now imagine being a less-experienced Ops engineer in a relatively small on-call team struck by tragedy. If you don't have sufficient guidance, let alone moral support- you're toast.

Read Post

Squadcast

Read more about What are Runbooks? And why are they needed?

Performing Postmortems & Postmortem Templates at Squadcast | SRE Best practices | Squadcast

Aug 25, 2022 By Squadcast In Squadcast

Postmortems are a way to summarize the resolution for an incident once it is resolved. It is also a way for you to create a knowledge-base of failures and fixes that can be shared across your team to help build a culture of shared learning and learning from failures.

View Video

Squadcast

Read more about Performing Postmortems & Postmortem Templates at Squadcast | SRE Best practices | Squadcast

Using StatusPage at squadcast | SRE Best practices | Squadcast

Aug 25, 2022 By Squadcast In Squadcast

Let your customers know how your Services are doing, without them having to ask you about it. One of the core principles of SRE is Transparency and Status Pages help you communicate the status of your Services to your customers at all times, as opposed to you getting to know the status of your Services through support tickets logged by your customers.

View Video

Squadcast

Read more about Using StatusPage at squadcast | SRE Best practices | Squadcast

What are Canary Deployments and Why are they Important?

Aug 25, 2022 By Vishal Padghan In Squadcast

Every modification to software comes with the potential for production problems. Application failures often have serious consequences which can result in a loss of revenue and a poor customer experience. Additionally, organizations constantly try to improve their services for a better customer experience. How can you minimize the chance of error and update your application with confidence?

Read Post

Squadcast

Read more about What are Canary Deployments and Why are they Important?

Site Reliability Engineering, Site Reliability Engineers and SRE Practices: State of Adoption

Aug 24, 2022 By Heidi Gilmore In StackState

Site reliability engineering (SRE) is what you get when you treat operations as if it’s a software problem. The mission of an SRE practice is to protect, provide for and progress the software and systems offered and managed by an organization with an ever-watchful eye on their availability, latency, performance and capacity.1.

Read Post

StackState

Read more about Site Reliability Engineering, Site Reliability Engineers and SRE Practices: State of Adoption

Site Reliability Engineering: Definition, Principles & How It Differs From DevOps

Aug 22, 2022 By MoovingON In MoovingON

Site crashes and outages can cost hundreds of thousands in lost revenue and inconvenience users. Site Reliability Engineering helps build highly reliable and scalable systems, particularly important for companies that depend on their software to support their customers performing critical operations. Hiring a Site Reliability Engineer is the best way to ensure a software system stays up and running at all times. Not only will they help manage infrastructure and applications, but they'll also be able to advise on how to scale a business as it grows - keeping downtime and incidents at a minimum!

Read Post

MoovingON

Read more about Site Reliability Engineering: Definition, Principles & How It Differs From DevOps

Uptime + Squadcast Integration: Routing Alerts Made Easy

Aug 18, 2022 By Vishal Padghan In Squadcast

Uptime is a site monitoring solution used to reach various endpoints & notify users via push notifications when downtime is detected. It collects and stores downtime & response time data & which is then made available as reports to the users. If you use Uptime for your monitoring needs, you can now integrate it with Squadcast to route detailed alerts from Uptime to the right users in Squadcast. The below steps will help you set up Uptime and Squadcast integration.

Read Post

Squadcast

Read more about Uptime + Squadcast Integration: Routing Alerts Made Easy

geeks+gurus: Rise of SRE - Survey Insights

Aug 18, 2022 By Sumo Logic In Sumo Logic

Site Reliability Engineering (SRE) continues to rise in adoption. Teams that leverage SRE “good” practices are benefitting, individuals are excited about their jobs and IT and the business are collaborating more efficiently. Sounds interesting? We hope so, as there are a few key insights which you should know. Join us to learn more about the exciting journey of SRE. We have partnered with DevOps Institute (DOI) to conduct their inaugural 2022 Global SRE Pulse Survey, and we are excited to share the pulse on SRE.

View Video