Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Internet Speed Monitoring - How to Proactively Test Your Internet Connections

Recent enhancements to eG Enterprise have added functionality to allow you to proactively test your internet speed with synthetic monitoring (“robot” tests that simulate real user activity). Using the new functionality you can proactively monitor internet speeds 24×7 from any location. The performance and quality of an Internet connection plays a major role in any IT environment. Use cases for this new functionality include.

Icinga Installation Guide - Part 1 - Getting started with a base Icinga Installation

Get up and running with Icinga 2 and Icinga Web in this step-by-step installation guide. In this video, we walk you through a complete base installation of Icinga, covering everything from setting up the database to accessing the web interface for the first time. This will help you get to the point of a working installation, especially if you're new to Icinga. We take you through the full process, including installing required components, configuring databases, enabling services, and completing the web setup wizard.

Icinga Installation Guide - Part 2 - Installing Icinga Director and configuring your first objects

Take the next step with Icinga by adding the powerful configuration management tool Icinga Director to your setup. In this second part of our installation guide, we focus on simplifying and scaling your configuration using the Director. You’ll learn how to connect it to your existing Icinga 2 instance, create reusable templates, and start monitoring hosts and services through a more flexible, web-based interface.

Leveraging Cognitive Diversity to Tackle System Complexity

Most engineering leaders today understand that diversity matters. They've built teams that reflect a range of backgrounds, functions, and experience levels. They run postmortems, retrospectives, and architecture reviews that bring multiple voices to the table. They believe, not unreasonably, that this variety of perspectives leads to better decisions. But there's a problem hiding inside that assumption that can undermine everything: who people are is a surprisingly poor predictor of how they think.

Observability Lessons From OpenAI

Writing code is moving from the good old IDE into the realm of autonomous AI agents. One example of this is OpenAI, which has been developing internally with 0 lines of manually written code. You can read about their workflow in their engineering blog: Harness engineering: leveraging Codex in an agent-first world. For me, the main takeaway of OpenAI’s article is how AI has rewritten the constraints equation.

API Error Monitoring: A Complete Guide to Detecting and Resolving API Failures

APIs power nearly every modern digital experience. From mobile apps and SaaS platforms to payment gateways and internal microservices, APIs handle authentication, transactions, content delivery, and system-to-system communication. When an API fails, users often experience broken features, slow responses, or complete service outages. In many cases, they leave before your team even realizes something is wrong. The business impact of API failures is significant.

API Availability Monitoring: How to Measure True API Availability

APIs are no longer just integration layers. They power customer logins, payment processing, SaaS workflows, partner ecosystems, and mobile applications. When an API becomes unavailable, revenue stops, user trust declines, and service level agreements are immediately at risk. Yet many teams still define API availability in the simplest possible way. If an endpoint responds with a 200 OK, the API is considered available. Monitoring dashboards stay green. Alerts remain silent. Everything appears healthy.

Grafana Campfire - Release Pipelines - (Grafana Community Call - March 2026)

In this Campfire Community call, we'll be exploring Grafana's release pipelines - covering both our on-prem (public and private) artifact delivery and our Rolling Release Channels for building Grafana Cloud We'll walk through the fundamentals of how our pipelines work, including how ICs can patch branches and manage their own core Grafana releases, and where we're headed in the future. Plus much more!