Operations | Monitoring | ITSM | DevOps | Cloud

Tracing

The latest News and Information on Distributed Tracing and related technologies.

Avoid Stubbing Your Toe on Telemetry Changes

When you have questions about your software, telemetry data is there for you. Over time, you make friends with your data, learning what queries take you right to the error you want to see, and what graphs reassure you that your software is serving users well. You build up alerts based on those errors. You set business goals as SLOs around those graphs.

OpenTelemetry and Grafana Labs: What's new and what's next

A new year is a natural time to reflect on past achievements — and consider future aspirations. When I think about the observability space, specifically, in 2023, OpenTelemetry felt omnipresent. It was a hot topic at every industry event, with at least one dedicated talk at ObservabilityCON, Monitorama, PromCon, and KubeCon + CloudNativeCon, both NA and EU. A notable highlight from KubeCon was OpenTelemetry going GA, marking a significant milestone in the project’s development.

OpenTelemetry vs Prometheus Detailed Comparison

Both OpenTelemetry and Prometheus are open-source projects under the Cloud Native Computing Foundation. OpenTelemetry is a more comprehensive observability framework with support for metrics, traces, and logs. In contrast, Prometheus is focused specifically on time-series metrics. OpenTelemetry is more versatile, and if you’re confused between choosing between the two, go for OpenTelemetry. We will delve deeper into the reason for choosing OpenTelemetry over Prometheus in this article.

Observability with OpenTelemetry and Checkly

Observability isn't just a buzzword; it's a vital compass guiding us through the maze of system health and performance. As we’ve adopted microservice architectures, the ability to know ‘what is currently happening in our system’ has diminished as our operational resilience has increased. We find services scattered among a maze of interconnections and interdependencies. And even the logs that used to guide are now scattered throughout this maze.

RabbitMQ monitoring with OpenTelemetry

More about SigNoz: SigNoz - Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc. Backed by Y Combinator. SigNoz helps developers monitor applications and troubleshoot problems in their deployed applications. SigNoz uses distributed tracing to gain visibility into your software stack. If you need any clarification or find something missing, feel free to raise a GitHub issue with the label documentation or reach out to us at the community slack channel.

Understanding Flame Graphs for Visualizing Distributed Tracing

In the ever-evolving world of software development, one constant remains - the pursuit of better performance. As applications grow in complexity and demand, the need for tools to uncover performance bottlenecks becomes paramount. Flamegraphs, a brainchild of Brendan Gregg, has emerged as an important visualization of insight, showing those dark corners of your codebase that need optimization.

OpenTelemetry Collector - architecture and configuration guide

OpenTelemetry Collector is a stand-alone service provided by OpenTelemetry. It can be used as a telemetry-processing system with a lot of flexible configurations to collect and manage telemetry data. Let's do a deep dive on OpenTelemetry Collectors to understand how it works. The first step in setting up observability with OpenTelemetry is instrumentation. The application code is instrumented with OpenTelemetry client libraries that help generate telemetry data like logs, metrics, and traces.

Accelerate TraceQL queries at scale with dedicated attribute columns in Grafana Tempo

With Grafana Tempo 2.3, we introduced a new storage format (vParquet3), which enabled an exciting new feature (dedicated attribute columns) that focused on the read path. Dedicated attribute columns offer a wide range of benefits primarily centered around query performance and memory usage. These columns can improve read speed across most queries, and they can have a major impact on resource utilization.

OTel Applications on Retrace

We are excited to inform you that Open Telemetry is now available for you with the introduction of “Netreo OTel Appliance”. With the OTel Appliance, cloud-native services like AWS Lambda, AWS ECS, AWS EKS, Azure Functions, Azure App Services, Azure Container Instances, and Azure Kubernetes Services can be monitored and you see application traces and logs in Retrace UI (s1.stackify.com). The applications hosted in the cloud Serverless and containers can be monitored without running the Retrace agent within the instance itself.

Decoding PostgreSQL Monitoring | 101 Guide

Monitoring PostgreSQL for performance issues is critical. PostgreSQL is a powerful open-source relational database system that stands out for its robustness, scalability, and strong emphasis on extensibility and standards compliance. In this guide on PostgreSQL monitoring, we will cover key PostgreSQL metrics that should be monitored, best practices for monitoring PostgreSQL and some tools with which you can set up PostgreSQL monitoring.