%term

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Jaeger Metrics: Internal Operations and Service Performance Monitoring

Jul 15, 2025 By Faiz Shaikh In Last9

You're monitoring a microservices-based system. Alerts trigger when response times exceed 2 seconds. But when you open Jaeger, you're faced with thousands of traces. Identifying which service or operation is responsible becomes time-consuming. Jaeger metrics help reduce this friction by exposing aggregated telemetry. Instead of scanning individual traces, you get service-level and operation-level performance metrics, latency, throughput, and error rates that highlight where the issue lies.

Read Post

Last9

Read more about Jaeger Metrics: Internal Operations and Service Performance Monitoring

Arie's Adventures with Coroot

Jul 15, 2025 By Arie Van Den Heuvel In Coroot

Arie van den Heuvel is an engineer, a System and Application Management Specialist, and a valued member of our community. Below he has shared his journey using Coroot, and how it has helped improve observability for his team. You can read more of Arie’s writing and support the resource articles he has created for open source on his blog.

Read Post

Coroot

Read more about Arie's Adventures with Coroot

Identifying Idle Paths in a Data Center Leaf-Spine Fabric

Jul 15, 2025 By Kentik In Kentik

In a perfect leaf-spine network, traffic evenly spreads across all links. But reality is often different, leaving costly, idle paths hidden in your data center fabric. Kentik's Phil Gervasi demonstrates how Kentik's network intelligence platform helps engineers quickly identify and address these underutilized paths. With powerful visualizations, detailed telemetry analysis, and customizable alerts integrated into your ticketing systems, Kentik makes it easy to spot persistent traffic imbalances, troubleshoot ECMP issues, and optimize your infrastructure.

View Video

Kentik

Read more about Identifying Idle Paths in a Data Center Leaf-Spine Fabric

Real-Time Alerting for AI-Optimized Data Centers

Jul 15, 2025 By Phil Gervasi In Kentik

Kentik transforms real-time network telemetry into actionable alerts for AI-optimized data centers. By converting database queries into custom alerts, engineers can detect issues like elephant flows, idle links, and packet loss before performance suffers and triggers alerts in systems like ServiceNow or PagerDuty.

Read Post

Kentik

Read more about Real-Time Alerting for AI-Optimized Data Centers

Atatus APM: Full-Stack Visibility for Modern Engineering Teams 2025

Jul 15, 2025 By Pavithra Parthiban In Atatus

APM stands for Application Performance Monitoring or Application Performance Management. It helps engineering teams track key metrics, detect slowdowns, and improve the overall performance of their applications. With Atatus APM, you get complete visibility into your application, from backend code and databases to external services and frontend performance.

Read Post

Atatus

Read more about Atatus APM: Full-Stack Visibility for Modern Engineering Teams 2025

How to Troubleshoot Outages Faster Using Elastic Observability [2 Min Live Demo]

Jul 15, 2025 By Elastic In Elastic

In this video, I’ll show you how Elastic Observability helps you reduce downtime, accelerate root cause analysis, and unify logs, metrics, and traces in one powerful dashboard. With native OpenTelemetry support, AI-powered troubleshooting, and built-in anomaly detection, you can streamline your workflows and boost service reliability.

View Video

Elastic

Read more about How to Troubleshoot Outages Faster Using Elastic Observability [2 Min Live Demo]

Cloudflare's Resolver Outage: More Than Just DNS

Jul 15, 2025 By Catchpoint Team In Catchpoint

“It’s always DNS.” That’s the running joke in IT. When websites won’t load and apps grind to a halt, DNS—the internet’s address book—is often the first to get blamed. That’s because DNS translates human-friendly names like google.com into IP addresses that computers use to route traffic.

Read Post

Catchpoint

Read more about Cloudflare's Resolver Outage: More Than Just DNS

From Reactive to Proactive: A User-Centric Digital Strategy for Banks

Jul 15, 2025 By Catchpoint In Catchpoint

In today's digital-centric banking environment, financial institutions must be able to provide seamless and reliable application performance across all digital channels - from a branch to a mobile device. Failure to do so results in real impact to customer satisfaction, trust, and loyalty. Modern banking applications are increasingly complex, running off of internet-centric distributed architectures involving many different parties and services. For these modern tech frameworks, traditional APM tools are no longer sufficient to ensure service reliability and optimal customer experience.

View Video

Catchpoint

Read more about From Reactive to Proactive: A User-Centric Digital Strategy for Banks

If your site is slow, it might as well be down.

Jul 15, 2025 By Catchpoint In Catchpoint

It’s no longer enough for a site to just be available; it had to be fast. If the experience lags, your customers will bounce within seconds. The consequences scale fast: business stops and revenue disappears. You need to monitor performance across the full delivery chain because speed is what keeps users engaged.

View Video

Catchpoint

Read more about If your site is slow, it might as well be down.

Smarter Workflows, Faster Insights: How InfluxDB 3 Unlocks the Power of Python at the Source

Jul 15, 2025 By Allyson Boate In InfluxData

Businesses across industries rely on time-stamped data to track system health, monitor performance, and improve operations. Whether it’s sensors on a factory floor or usage logs from a SaaS platform, time series data reveals how things change. As businesses digitize operations and add connected devices, sensors produce growing streams of time-based data. This opens the door to faster analytics and smarter automation. But legacy approaches can’t keep up.

Read Post