%term

The latest News and Information on Service Reliability Engineering and related technologies.

What is OpenTelemetry Collector

Jul 17, 2023 By Last9 In Last9

What is OpenTelemetry Collector, Architecture, Deployment and Getting started.

Read Post

Last9

Read more about What is OpenTelemetry Collector

How JCB is leveraging SRE to lead a successful digital transformation

Jul 15, 2023 By Shimpei Sasano In Google Operations

How JCB improves team structure, risk management, and application and platform development.

Read Post

Google Operations

Read more about How JCB is leveraging SRE to lead a successful digital transformation

InfluxDB vs. Thanos

Jul 14, 2023 By Prathamesh Sonpatki In Last9

InfluxDB vs Thanos: Overview, Pros and Cons, and Differences.

Read Post

Last9

Read more about InfluxDB vs. Thanos

What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Jul 14, 2023 By incident.io In Incident.io

Site reliability engineers manage a lot, and often in incredibly high-stakes environments. Remember that scene from "The Matrix" where Neo dodges bullets in slow motion? Of course you do. As an SRE, it can feel like you're the person getting hit by those bullets, frantically trying to investigate performance issues, automate away toil, and support the engineers around you, all before the next wave of attacks.

Read Post

Incident.io

Read more about What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Improve Visibility and Capture More Data with Triage Incidents

Jul 12, 2023 By Ashley Sawatsky In Rootly

As new incidents emerge, there are often many unknowns about the size, severity, and cause of the problem. Sometimes it’s not clear if the problem is an incident at all. That’s where introducing a triage stage to your incident management process can help. In this post, we’ll look at the benefits of adding a triage layer to your incident management, and how Rootly’s Triage feature allows you to seamlessly transition from triage to real incident (or false alarm).

Read Post

Rootly

Read more about Improve Visibility and Capture More Data with Triage Incidents

What Site Reliability Engineering needs - A swarm of rogue bees

Jul 11, 2023 By Aniket Rao In Last9

If all companies are software companies, all companies need better Observability to understand how performative their software is.

Read Post

Last9

Read more about What Site Reliability Engineering needs - A swarm of rogue bees

Prometheus vs. VictoriaMetrics (VM)

Jul 10, 2023 By Last9 In Last9

Comparing Prometheus vs. VictoriaMetrics (VM) - Scalability, Performance, Integrations.

Read Post

Last9

Read more about Prometheus vs. VictoriaMetrics (VM)

Prometheus vs. Cortex

Jul 7, 2023 By Last9 In Last9

Comparing Prometheus vs. Cortex - Scalability, Cost, Performance, Known Weaknesses.

Read Post

Last9

Read more about Prometheus vs. Cortex

Docker Compose Logs: Guide & Best Practices

Jul 2, 2023 By Squadcast Community In Squadcast

Docker Compose is a tool for defining and running multi-container Docker applications. It allows developers to streamline the process of configuring, building, and running multiple containers as a single unit with a docker-compose.yml. This configuration file specifies the services, networks, and volumes required for an application, and their relationships and dependencies. The docker-compose logs command displays the logs of all services defined in the docker-compose.yml file.

Read Post