%term

The latest News and Information on Service Reliability Engineering and related technologies.

Migrating From Your Tool to Squadcast

Jun 17, 2024 By Chitra Bisht In Squadcast

In our recent blog we talked about how having separate tools for On-Call and for alerting sucks! And how Squadcast offers a lifeline with its all-in-one Incident Management and Reliability Automation platform by amalgamating multiple tool functionality under a single hood. This blog is all about how you can easily transition from your current Incident Management & alerting tool into a better and more reliable enterprise grade platform with Squadcast.

Read Post

Squadcast

Read more about Migrating From Your Tool to Squadcast

Complete Incident Management Playbook for Enterprises

Jun 14, 2024 By Vishal Padghan In Squadcast

Effective Incident Management is indispensable for maintaining the stability and reliability of enterprise operations. Modern businesses heavily depend on their IT infrastructure, making the swift and efficient management of incidents that disrupt normal operations a top priority. A robust Incident Management process can significantly reduce downtime, boost productivity, and uphold customer satisfaction.

Read Post

Squadcast

Read more about Complete Incident Management Playbook for Enterprises

Free to be SRE, with this systems engineering syllabus

Jun 14, 2024 By Max Saltonstall In Google Operations

Learn more about systems engineering and how to get started with these key resources curated by Google’s Site Reliability Engineering (SRE) team.

Read Post

Google Operations

Read more about Free to be SRE, with this systems engineering syllabus

What needs to change in software monitoring?

Jun 13, 2024 By Aniket Rao In Last9

A wishlist of things that need to change in the world of software monitoring.

Read Post

Last9

Read more about What needs to change in software monitoring?

How Agile Leadership Transforms IT Operations

Jun 11, 2024 By Chitra Bisht In Squadcast

Traditional IT operations, with their waterfall processes and lengthy release cycles, can feel sluggish in today's business environment. This constant state of "catch-up" can lead to frustration for developers, ops staff, and business leaders alike. Developers struggle to see their innovative ideas come to life quickly. Operations teams scramble to deploy code that feels outdated before it even hits production. Business leaders see their growth potential hampered by slow IT delivery.

Read Post

Squadcast

Read more about How Agile Leadership Transforms IT Operations

Dynamic Annotations in Last9: Unlocking routing links, detailed alert descriptions, and more

Jun 11, 2024 By Last9 In Last9

Using template variables to insert dynamic values based on labels for use cases like detailed alert descriptions, routing links, etc.

View Video

Last9

Read more about Dynamic Annotations in Last9: Unlocking routing links, detailed alert descriptions, and more

How we reduced monitoring costs and deprecated Thanos for Replit

Jun 7, 2024 By Prathamesh Sonpatki In Last9

Winning Replit over by taming High Cardinality data and deprecating Thanos.

Read Post

Last9

Read more about How we reduced monitoring costs and deprecated Thanos for Replit

2024 SRE Report: AI is not replacing human intelligence anytime soon

Jun 5, 2024 By Leo Vasiliou In Catchpoint

Automation cast a shadow over the future of work for many years. Generative AI (GenAI) is now the latest innovation stealing all the headlines, fueling countless debates and fears about machines taking over human jobs. However, our 2024 SRE Report offers a perspective that challenges this notion.

Read Post

Catchpoint

Read more about 2024 SRE Report: AI is not replacing human intelligence anytime soon

Assessing DevOps Performance - DORA Metrics

Jun 4, 2024 By Chitra Bisht In Squadcast

Feeling the pressure to constantly deliver new features? The struggle is real. But what if there was a way to measure your DevOps performance and transform your team into a release machine? This blog is all about DORA metrics, a data-driven framework to unlock DevOps agility. We'll explore what these metrics tell you, how to implement them, and ultimately, how to use them to turn your team into a release champion.

Read Post

Squadcast

Read more about Assessing DevOps Performance - DORA Metrics

How To Reduce The Alert Noise For Optimal On-Call Performance

May 31, 2024 By Chitra Bisht In Squadcast

The relentless push in organizations can have unintended consequences, particularly for your On-Call engineers. One threat that can quickly erode their effectiveness is alert noise. When your On-Call engineers are bombarded by constant alerts (– genuine emergencies, false positives or redundant notifications) it creates a state of information overload, forcing them to constantly switch context and struggle to identify the critical issues amidst the din. The result?

Read Post