Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Application Performance Monitoring and related technologies.

Claude Agent SDK Monitoring & Observability with OpenTelemetry and SigNoz

Learn how to implement monitoring and observability for the Claude Agent SDK using OpenTelemetry and SigNoz. In this video, we walk through instrumenting your Claude-based agents, capturing traces, metrics, and logs, and visualizing everything in SigNoz for real-time insights. You’ll learn how to debug agent behavior, identify latency bottlenecks, and monitor performance in production environments.

Release software with confidence using Datadog Feature Flags

In this technical product demo, see how Datadog Feature Flags helps teams release software with confidence by connecting every feature flag to real-time observability data. Configure progressive, multi-step rollouts with automated guardrails tied to APM, RUM, and Product Analytics so you can pause or roll back instantly if latency, errors, or key business metrics degrade.

Best Rails APM Tools in 2026: A Developer's Guide

Rails applications have a specific set of performance challenges that make monitoring genuinely useful rather than just box-checking. ActiveRecord is convenient to use and also convenient to accidentally write N+1 queries with. Memory bloat in long-running processes, particularly when Sidekiq or Action Cable is involved, is a recurring production problem for a lot of teams. Background job performance tends to degrade quietly until it becomes noticeable.

Datadog Incident Response: One platform from alert to resolution

When incidents strike, speed and clarity are critical. Datadog Incident Response brings the full incident lifecycle into one platform so teams can move from detection to resolution with confidence. Operate from a single, unified view of your systems, coordinate across the tools your teams already use, and leverage AI that analyzes incidents in real time to surface context, guide decisions, and accelerate resolution.
Sponsored Post

The art of software engineering management

Like any leadership role, leading an engineering team in a mature, compact company like Raygun comes with both honor and responsibility. Leading a major development project is a bit like conducting a symphony orchestra, where every individual plays a crucial role and has a great impact on the work they release to customers and end-users.

Cloud Application Slowness: When Every Team Says 'It's Not My Problem'

A retail ERP system underwent a vertical scaling operation to support growth from 3,000 to 10,000 stores on AWS. Immediately following the cutover, users experienced widespread HTTP 503 (“Service Unavailable”) errors and checkout failures. Yet, standard performance dashboards indicated a healthy environment.

Data Observability, AI Guard, Feature Flags, Ambassador program, and more | This Month in Datadog

See how you can ensure trust across the data life cycle in February’s episode of This Month in Datadog. Join us for a spotlight of Datadog Data Observability, which enables you to detect data quality and pipeline issues early, as well as remediate those issues with end-to-end lineage. Plus, we cover: Protecting agentic AI applications from real-time threats with Datadog AI Guard Staying up to date and reducing steps to collaborate with five new Incident Management releases Releasing software with confidence using Datadog Feature Flags.
Sponsored Post

SAP Application Performance Monitoring (APM): Beyond Generic Metrics

Your enterprise APM tool shows SAP is using 90% CPU. The dashboard turns red. An alert fires. Now what? You open Dynatrace. You see the Java Virtual Machine metrics for your NetWeaver stack. You see HTTP response times for your Fiori apps. You see a spike in database calls. None of this tells you why VA01 takes 45 seconds to create a sales order. None of this tells you which custom ABAP report is consuming memory. None of this explains the short dump that crashed your pricing routine. This is the gap between generic APM and true SAP application performance monitoring. Your enterprise tools see the symptoms.

Case Study - Troubleshooting Storage Failures in a VMware ESXi Infrastructure

IT problems happen even in the best architected infrastructure due to configuration changes, failures, upgrades and such. How quickly and effectively you can detect and resolve such problems dictates how efficient your IT operation is. Today, I’ll cover how eG Enterprise helped us troubleshoot a hardware failure (a storage battery failure) that that caused a cascade of failures in a VMware ESXi infrastructure.