Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

VictoriaMetrics Virtual Meetup Q1 2026 - VictoriaMetrics Updates

VictoriaMetrics continues to enhance usability and developer experience with new built-in capabilities. A lightweight UI now provides clear client setup instructions, simplifying onboarding, while an integrated inspector offers powerful debugging tools directly within the platform. Default tenant configuration further streamlines initial setup, reducing friction for new deployments. In addition, the MCP Server is now included by default in VictoriaMetrics Cloud deployments, eliminating the need for manual installation and making advanced monitoring workflows more accessible out of the box.

The product signal latency gap slowing your growth

Organizations often call product managers the CEOs of the product. But PMs know that’s a myth. When a CEO wants a status report, they get one immediately. They don’t need to negotiate for engineering time, reconcile conflicting project priorities, or wait for a data scientist to find a gap in their schedule. For most PMs, simply understanding the state of the product is where growth can stall.

Test network paths with TCP, UDP, and ICMP in Datadog

When developers and SREs design application tests, they often prioritize user workflows and API availability. Extending that suite with network tests that match your app’s traffic protocols can reveal whether issues originate in the network or application layer. In this post, we’ll explore how you can design effective network tests using the Transmission Control Protocol (TCP), User Datagram Protocol (UDP), or Internet Control Message Protocol (ICMP), including.

Announcing Icinga 2.16.0 and 2.15.3

We are happy to announce the release of two new versions of Icinga 2 today, 2.16.0 and 2.15.3. The first one includes some new features highlighted below, as well as a number of bug fixes and other improvements. The latter one is a small bug fix release that brings some of the other fixes included in 2.16.0 to the 2.15.x branch as well.

What Is Wrong With PaaS Today?

In the wake of 2010s, PaaS felt like magic. You focused on the code, and the platform did the rest. You could ship a production app without knowing anything about networking or, heck, even what a load balancer is. Heroku in particular made deployment a lost thought, especially for early-stage companies. That era is somewhat over, not because platforms got worse overnight, but because the assumptions underneath them quietly stopped being true.

ActiveMQ Dead Letter Queue (DLQ) Management: The Complete Guide

If your Apache ActiveMQ deployment has a growing ActiveMQ.DLQ, you are not alone, and you are looking at the right problem. An unbounded, unmonitored dead letter queue is one of the most common root causes of "invisible" message loss in enterprise messaging environments. DLQ messages land without fanfare, nobody notices, and business-critical data quietly disappears from the processing pipeline.

Apache ActiveMQ vs Apache Artemis: The 2026 Definitive Guide

When engineers search for "Apache ActiveMQ vs Apache Artemis," most of what they find is either a shallow feature checklist or a confident recommendation to "just migrate to Apache Artemis." Neither helps a senior architect deciding whether to stay on a stable, battle-hardened Apache ActiveMQ deployment, or a platform team evaluating both options for a new system with clear eyes.

Setting the Bar for Agentic NetOps

AI has quickly become part of the language of network observability. Many vendors across the observability landscape can describe, summarize, correlate, or explain some data or situation, leveraging basic LLM capabilities. At a distance, many of these offerings sound similar. They promise faster insight, efficient operations, and a more intelligent path through rising complexity. But the industry has reached a point where surface-level similarity is creating noise, not value.

Managing OpenTelemetry Semantic Convention Migrations With the Collector

Real production data tells the story better than I can. Juraci Paixão Kröhling, a friend and fellow observability practitioner at OllyGarden, recently shared an example from an anonymized production environment: 1,830 occurrences of http.url and 23,984 occurrences of url.full in the same dataset. Both attributes describe the same thing. Both are actively being written to the same backend at the same time.