Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Improving browser tracing step by step

Browser tracing has always been one of those things that feels invisible until it isn’t. When it works well, you get clear, actionable insights into how your app is performing in the wild. When it doesn’t, you’re left staring at noisy data, gaps in traces, and spans that don’t quite tell the story. Over the last few months, we’ve been chipping away at that problem.

How AI-Native Data Pipelines Accelerate Threat Hunting

Every security incident starts as a small signal: an unusual connection, a mistyped command, or a subtle change in behavior. The challenge is that most of these signals are buried in mountains of noise. Modern SOCs process terabytes of telemetry each day, but only a fraction contains real indicators of compromise. Without intelligent detection and prioritization, valuable warning signs get lost in the flood of low-value events.

Mobile session replay - now live in Coralogix

Coralogix Real User Monitoring (RUM) already gives teams a complete view of how users experience their websites. Now, that same visibility comes to mobile. With Session Replay for iOS and Android, you can watch real sessions unfold and understand exactly what users saw and did, without relying on vague support tickets or incomplete crash logs. Session replay captures exactly how users interact with your mobile app: taps, swipes, scrolls, and screen transitions.

Manage and optimize your OCI costs with Datadog Cloud Cost Management

Engineering teams need to deliver reliable, secure, and high-performing applications, all while keeping costs under control. But engineers often lack visibility into cloud cost data, relying on finance-driven reports that they receive only after the billing cycle closes. Without daily cost insights alongside observability data, they don’t know until it’s too late that an infrastructure change caused a significant cost increase.

Stop decision overload: How discovery filters optimize device onboarding for efficient network monitoring

Every network administrator encounters the same question during discovery scans: Should this device be monitored or ignored? Routers are critical, but what about test servers, lab switches, or that aging and unused printer still on the network? Manually deciding for each device creates decision overload and risks overlooking what really matters.

How to Scale Prometheus APM for Modern Applications

When developers monitor application performance, they pick one of two paths: traditional APM tools with distributed tracing and code profilers, or metrics-driven monitoring with Prometheus. The second approach — Prometheus APM — tracks the signals that matter most: request rates, error rates, latency, and resource utilization. No agents to install, no per-host pricing, just exporters and PromQL. For most teams, Prometheus APM is where monitoring starts.

How image generation models are creating new infrastructure demands for DevOps teams

The rapid adoption of generative AI has moved far beyond research labs and creative studios. Image generation models, in particular, have become critical components in content production pipelines, marketing platforms, design workflows, and enterprise applications. What began as a novel way to create digital art has evolved into a class of workloads that behave very differently from traditional web services.

OpenTelemetry + ignio: The Foundation for Intelligent, Unified Observability

In the previous post, What is OpenTelemetry?, we went over the What, Why, and the How of OpenTelemetry. We also went over the telemetry data lifecycle (data generation à collection à storage à usage) and how telemetry data (MELT) could be put to use to troubleshoot a representative web application scenario.