Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Introducing System Datasets: Observing the Observability Platform

Modern observability platforms are great at explaining what’s happening in your apps and your infrastructure. However, all too often the observability platform itself remains a black box. As observability data and usage grow, governance almost always lags behind, and teams struggle to answer basic operational questions like: This valuable data is typically fragmented across admin UIs, billing pages, support tickets, and tribal knowledge.

GenAI Observability in Grafana Cloud: End-to-End Agent Debugging (Demo)

From Observability for GenAI Applications (Grafana OpenTelemetry Community Call) We drill into traces to see which agents called which tools, where errors occurred, how long each LLM call took, and how costs and tokens are distributed. The walkthrough also covers using AI assistance to summarize long traces and identify optimization opportunities in real time..

API Observability: Why Outside-In Signals Are Still Essential

API observability has become a go-to goal for modern engineering teams. As architectures shift to microservices and APIs become the backbone of products, teams need a reliable way to understand what’s happening across services, before issues turn into incidents. That’s where observability comes in: collect the right signals, connect the dots, and debug faster.

SQL performance improvements: automatic detection & regression testing (part 3)

This is the final part of our 3-part series on SQL performance improvements. In part 1, we covered how to identify slow queries. In part 2, we explored how to fix them with indexes. In this post, we'll share how we prevent those performance issues from ever reaching production again. A few weeks ago, we massively improved the performance of the dashboard & website by optimizing our SQL queries.

Monitor groups are now supported in the API

We recently launched monitor groups, making it easier to organize monitors on your boards and status pages. Now that same functionality is available in the StatusGator API, so you can manage monitor groups programmatically. The API now supports listing, creating, updating, and deleting monitor groups on a board. You can also assign or remove monitors from groups when creating or updating a monitor.

Best DNS Monitoring Tools in 2026

DNS monitoring is the practice of continuously checking that your domain names resolve correctly (right records, right answers) and that DNS lookups are fast and reliable from multiple locations. Depending on the tool, it can also watch for unexpected DNS record changes (A/AAAA/CNAME/MX/NS/TXT, etc.), validate DNSSEC, and pinpoint where resolution breaks in the chain.

API Uptime Monitoring Explained: How to Measure True API Availability in Production

For many teams, API uptime monitoring still means one simple thing: checking whether an endpoint responds with a 200 OK. If the check passes, the API is marked as “up.” If it fails, an alert is triggered. On paper, that sounds reasonable. In practice, it’s one of the most common reasons API outages go unnoticed until users complain. The problem is that modern APIs are no longer simple, stateless endpoints.