Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Easiest Way to Ship Docker & Nginx Logs to Loki with Promtail

Effective monitoring catches problems before users do, and with Promtail, Loki, and LogQL, it’s a lightweight, approachable option for any DevOps team. This guide shows how to monitor Docker itself (pull failures, restarts, health flaps) so you’ve got a baseline on container runtime health.

How to check CPU usage on Linux

When your Linux system feels sluggish, one of the first things to investigate is the CPU usage. The CPU (Central Processing Unit) is the brain of your machine, and if it’s overloaded, everything else slows down. In this guide, you’ll learn different ways to Linux check CPU usage with command-line tools, how to interpret the metrics, and why automatic monitoring with Icinga ensures long-term system stability.

OTel Naming Best Practices for Spans, Attributes, and Metrics

An incident’s in progress. Services are slow, customers are frustrated, and your dashboards… look fine. At least, until you search for payment metrics and get 47 different names for the same signal. Suddenly, the real issue isn’t latency — it’s inconsistency. The OpenTelemetry project recently published a three-part series on naming conventions to solve exactly this problem.

Why 1% Packet Loss Is the New 100% Outage

For years, you had an unspoken agreement. Your networks were built to be resilient, and your applications were, for the most part, forgiving. You sent emails, transferred files, and backed up data. If a few packets went missing along the way, the protocols would quietly clean up the mess. A little bit of packet loss was just background noise, an expected imperfection in a system that was, by and large, incredibly robust. You could tolerate it.
Sponsored Post

7 Downdetector Alternatives

Downdetector is one of the best-known outage-tracking platforms, but its consumer-first approach has limitations for technical teams. Its reliance on user-submitted incident reports makes it prone to noise, false positives, and incomplete coverage of B2B and cloud-specific services. That's why we're exploring the best Downdetector alternatives available today, and highlighting which ones work best for businesses.
Sponsored Post

Innovating Security with Managed Detection & Response (MDR) and ChaosSearch

Managed Detection and Response (MDR) services occupy an important niche in the cybersecurity industry, supporting SMBs and enterprise organizations with managed security monitoring and threat detection, proactive threat hunting, and incident response capabilities. In this week's blog, we're taking a closer look at the role of MDRs in cybersecurity, the biggest challenges they face, and how integrating ChaosSearch is helping MDRs manage complexity, reduce data retention costs, and enable long-term security analytics use cases that are critical for customer success.