Operations | Monitoring | ITSM | DevOps | Cloud

Observability

The latest News and Information on Observabilty for complex systems and related technologies.

What Do Developers Need to Know About Kubernetes, Anyway?

Stop me if you’ve heard this one before: you just pushed and deployed your latest change to production, and it’s rolling out to your Kubernetes cluster. You sip your coffee as you wrap up some documentation when a ping in the ops channel catches your eye—a sales engineer is complaining that the demo environment is slow. Probably nothing to worry about, not like your changes had anything to do with that… but, minutes later, more alerts start to fire off.

Resolve issues faster with Grafana Cloud Application Observability

Grafana Cloud Application Observability provides an out-of-the box experience to monitor application performance and minimize MTTR. With its native support of the open standards OpenTelemetry and Prometheus, Application Observability unifies signals across the full stack, accelerating root cause analysis while removing proprietary formats and vendor lock-in. Watch this demo of how to use Application Observability in Grafana Cloud.

Zero-code application observability with Grafana Beyla and eBPF: demo

The eBPF-based OSS auto-instrumentation tool Grafana Beyla makes it easier to get started with application observability. Beyla provides RED (Rate, Errors, Duration) metrics through OpenTelemetry or Prometheus for your existing web services, whichever language they are written in. You don’t need to change any line of application code or configuration; you only need to deploy the Beyla in the same host as the service that you want to monitor. Collecting monitoring data with the eBPF autoinstrument tool has very low overhead, and allows you to capture data about your runtime, which is impossible with manual code instrumentation. Watch this in-depth demo of how to use Grafana Beyla to get started with application observability.

Control Prometheus cardinality and metrics cost with Adaptive Metrics

Adaptive Metrics is a cost management feature in Grafana Cloud that helps enterprises control Prometheus cardinality and reduce their observability spend by identifying and eliminating unused metrics. Grafana Cloud customers using Adaptive Metrics see 20-50% reduction in their observability bill.

How Mercado Libre scales its AWS microservices without losing visibility

Learn how Mercado Libre acts more quickly, strategically, and proactively thanks to Datadog’s centralized platform and context-rich alerting.Mercado Libre hosts the largest online commerce and payments ecosystem in Latin America, which means thousands of dollars can be lost if some of their critical applications stop working for even 1 minute. Senior Technical Manager Juliano Martins and software expert Marcelo Quadros share a few reasons why they chose Datadog as their observability platform of choice for their AWS environment: the power of our infrastructure monitoring solution, extensive range of integrations, strong reputation in the market, and more.

How observability and AIOps work better together

If you’re juggling complex, cloud-based, containerized systems and aiming to meet high customer expectations, your old monitoring processes probably don’t cut it anymore. Increasing infrastructure complexity means you need to instrument more, log more, and monitor more. That leads to even more complexity. The answer is better observability, right? Yes and no. Observability and monitoring are critical, but they are only part of what you need for service awareness and availability.

Simplify OpenTelemetry Pipelines with Headers Setter

In telemetry jargon, a pipeline is a directed acyclic graph (DAG) of nodes that carry emitted signals from an application to a backend. In an OpenTelemetry Collector, a pipeline is a set of receivers that collect signals, runs them through processors, and then emits them through configured exporters. This blog post hopes to simplify both types of pipelines by using an OpenTelemetry extension called the Headers Setter.

Observability Shifts Right

Observability first emerged as a focal point of interest in the DevOps community in the 2017 time frame. Aware that business was demanding highly adaptable digital environments, DevOps professionals realised that high adaptability required a new approach to IT architecture. Whereas historically, digital stacks were monolithic or, at best, coarsely grained, the new stacks would have to be highly modular, dynamic, ephemeral at the component level, and spread over multiple cloud-based services.

Tackling Staffing, Funding, and Data Challenges Head-On with TAQA

Join Ed Bailey and TAQA Group's Andrew Ochse as they discuss the diverse services that TAQA offers, look at the challenges with scaling and staffing, and explore in great detail the solutions to classic problems such as insufficient funding, poor data quality, and slow connections linking global sites to their Security Operations Center (SOC).