SRE

The latest News and Information on Service Reliability Engineering and related technologies.

OpenTelemetry vs. Prometheus

Jul 26, 2023 By Last9 In Last9

OpenTelemetry vs. Prometheus - Difference in architecture, and metrics.

Read Post

Last9

Read more about OpenTelemetry vs. Prometheus

Breaking Down the Pillars of Observability from Data to Outcomes

Jul 25, 2023 By Last9 In Last9

The world of cloud-native and distributed microservices has revolutionized software development and deployment. However, the sheer volume of data these systems generate can often lead to confusion and uncertainty. You're not alone if you've ever felt lost in the sea of observability data.

View Video

Last9

Read more about Breaking Down the Pillars of Observability from Data to Outcomes

Webinar: Embracing Declarative Provisioning and Observability in cloud environments

Jul 24, 2023 By Last9 In Last9

Organizations face increasingly complex challenges in deploying and managing their systems in today's rapidly evolving technological landscape. Declarative provisioning and observability have emerged as a powerful approach to address these challenges. This talk delves into declarative provisioning and observability, exploring its benefits, principles, and practical implementation strategies.

View Video

Last9

Read more about Webinar: Embracing Declarative Provisioning and Observability in cloud environments

Introduction to ELK Tech Stack

Jul 21, 2023 By Chitra Bisht In Squadcast

ELK Stack, also known as the Elastic Stack is a powerful and versatile open-source toolset that has revolutionized the way businesses manage and analyze their data. ELK Stack seamlessly integrates these three robust components to offer a comprehensive solution for searching, analyzing, and visualizing large volumes of data in real-time. So, buckle up, for a comprehensive overview of the ELK stack and its components, which will be a great starting point for beginners.

Read Post

Squadcast

Read more about Introduction to ELK Tech Stack

Pinpoint performance issues in downstream services with the Dependency Map Navigator

Jul 21, 2023 By Scott Richardson In Datadog

Visibility into the upstream and downstream dependencies of your services is key to maintaining a performant microservices environment. Application developers and SREs rely on this visibility to quickly trace issues back to the source, which is essential during incidents—when time is of the essence—throughout day-to-day operations, and as systems evolve and scale.

Read Post

Datadog

Read more about Pinpoint performance issues in downstream services with the Dependency Map Navigator

Blameless Unveils Multibot Support, Empowering Enterprise Security Teams to Manage Incidents on their Terms

Jul 20, 2023 By Blameless In Blameless

Leading Incident Management Solution's New Multibot Feature Allows SecOps Teams to Achieve Greater Flexibility and Convenience.

Read Post

Blameless

Read more about Blameless Unveils Multibot Support, Empowering Enterprise Security Teams to Manage Incidents on their Terms

Enhanced Incident Response: Maximizing Microsoft Teams with Squadcast

Jul 20, 2023 By Abhishek Sony In Squadcast

Off late more and more businesses are relying on ChatOps tools like Microsoft Teams for a range of functions beyond simple communication. Incident management is no exception to this growing trend. However, Microsoft Teams alone may not possess all the necessary capabilities to efficiently perform these functions. To bridge this gap, integration with core applications becomes necessary.

Read Post

Squadcast

Read more about Enhanced Incident Response: Maximizing Microsoft Teams with Squadcast

Mastering Zero Trust - Pillars for Security

Jul 20, 2023 By Emily Arnott In Blameless

Zero Trust is a heightened security measure that blocks people and devices from accessing company data by default, only allowing access to those who prove they require it. Zero Trust assumes restricted access to company resources by all: Anyone or anything accessing company resources requires verification each time the system is accessed. There are no options to “trust this device next time” or “save password for next time”.

Read Post

Blameless

Read more about Mastering Zero Trust - Pillars for Security

Templates for Automating Incident Response

Jul 20, 2023 By Emily Arnott In Blameless

A security incident is the last thing any DevOps lead wants to see. Along with the vast number of protocols required to overcome an incident, there’s a hefty amount of paperwork to complete. Security incidents can even lead to legal repercussions, if personal data is leaked. Incident response templates offer insight into: An incident response plan template drastically reduces the time and effort spent dealing with incident reports.

Read Post

Blameless

Read more about Templates for Automating Incident Response

Unveiling Multibot, the "glue" for enterprise workflows

Jul 19, 2023 By Alex Greer In Blameless

How are you delivering Slack incident management workflows that serve the many teams across your enterprise? How are you addressing the differences in their use cases, access needs, isolation needs, and tech stacks, all while enabling everyone to collaborate? These are challenging questions to answer. To effectively do so, you have a host of conditions to support at the team and company-wide levels: ‍ Team ‍ Company-wide ‍

Read Post