Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

10 Best Practices for Proactive Database Performance Monitoring to Prevent Downtime

Databases are the core of modern applications, whether it is an e-commerce platform, a banking system, or a social media app. Slow database performance or unexpected downtime can cause serious problems, from lost revenue to poor customer experience. Proactive database performance monitoring helps teams identify issues before they escalate. Unlike reactive monitoring, which only addresses problems after they occur, proactive monitoring ensures your database remains fast, stable, and reliable.

Node.js Event Loop: Why Monitoring Matters

Node.js has become a cornerstone for modern application development because of its non-blocking and asynchronous architecture. According to Stack Overflow Developer Survey, Node.js remains among the most widely used technologies for web applications, powering millions of services globally. While this event-driven model provides scalability and efficiency, it also introduces challenges.

InfluxDB 3 Enterprise: Deploy Your Way, Scale on Demand

InfluxDB 3 Enterprise is engineered for performance and designed for flexibility, delivering high-scale, production-ready time series data management with operational simplicity. InfluxDB 3 Enterprise is built on a cloud-native, diskless architecture that removes the limits of traditional storage. It’s easy to deploy, scales effortlessly, and eliminates the complexity of managing clusters so you can deploy your way and meet the unique demands of your environment.

Automate Your Infrastructure Analysis with Scheduled AI Reports

The least exciting part of an operations or SRE role is often the manual, repetitive task of generating reports. It’s the Monday morning scramble to summarize weekly infrastructure health for the team, or the end-of-quarter push to build a capacity planning document. This is boilerplate work that pulls you away from critical engineering tasks. We believe that if a process is repeatable, it should be automated. That’s why we’re introducing Scheduled AI Investigations and Insights.

How to perform real-time DNS monitoring in Grafana Cloud

When DNS or domain name server resolution processes fail, or become sluggish, users can experience timeouts, connection errors, and degraded performance — often without clear indication of the root cause. This is where DNS checks in Grafana Cloud Synthetic Monitoring come in, allowing you proactively monitor domain name resolution, verify that domains resolve to the correct IP address, and even measure how quickly that resolution occurs.

ICMP Monitoring: What Is ICMP & How It Works

Ever “pinged” a server and wondered what those milliseconds actually mean? If you’re a network admin or IT pro, you already use ping as a quick sniff test. But ICMP is more than a green checkmark or a scary timeout. In this article, we’ll define ICMP, walk through how echo requests and replies work, and show how to turn basic pings into useful network and ICMP monitoring.

OpenTelemetry and Jaeger | Key Features & Differences [2025]

OpenTelemetry is a broader, vendor-neutral framework for generating and collecting telemetry data (logs, metrics, traces), offering flexible backend integration. Jaeger, on the other hand, is focused on distributed tracing in microservices. Earlier Jaeger had its own SDKs based on OpenTracing APIs for instrumenting applications, but now Jaeger recommends using OpenTelemetry instrumentation and SDKs. Warning The original Jaeger client SDKs (based on OpenTracing) are archived and no longer maintained.

From Shadow AI to Strategy: The Six-Month AI Imperative (w/ Charlene Li)

In this very special episode of The DEX Show, we welcome back one of the world’s most influential voices on digital transformation and the future of AI leadership: Charlene Li. Charlene is a bestselling author and trailblazing thinker who has helped leaders navigate disruption for over two decades. She returns to the show for an unmissable conversation on the realities of AI Transformation—and what it means for organizations, leaders, and employees at every level.