Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Announcing Scout's MCP Server for AI-Native Monitoring!

We’re excited to introduce the Scout Monitoring MCP Server — a new way to bring AI-native monitoring directly into your coding assistant. Instead of flipping between dashboards and logs, the MCP (Model Context Protocol) server surfaces performance data, errors, and slow endpoints right where you work. Ask plain-language questions like “show me the latest five errors” and get answers grounded in live telemetry. You can even let your coding assistant propose and push fixes!

Debugging Microservices in Production with Distributed Tracing

Your production checkout flow just started returning 500 errors. Six microservices handle checkout. Logs show errors in three of them. Which service broke? Which error happened first? What caused the cascade? Traditional debugging doesn't work. You can't attach a debugger to production. Searching logs across six services gives thousands of lines with no obvious connection. By the time you correlate timestamps and trace IDs manually, customers have abandoned their carts.

Cloud Microservices Monitoring on AWS and Azure with OpenTelemetry

Your checkout flow starts in an AWS Lambda function, calls a payment service running on EKS, then triggers notifications through Azure Functions. Three different compute platforms, two cloud providers, one distributed trace that you can't see. Cloud providers want you to use their native monitoring tools. AWS pushes X-Ray and CloudWatch. Azure promotes Application Insights and Azure Monitor. These tools work well within their ecosystems but lock you into vendor-specific implementations.

Observability - Not Just Dashboards and Alerts | Why Teams Like Uber & Salesforce Use Grafana Cloud

Grafana Cloud is a fully managed observability platform built on open source and open standards. From Fitbits to power grids, it helps teams monitor systems, cut through noise, and act faster. With 150+ integrations, Grafana Cloud unifies logs, metrics, and traces, giving visibility from backend to frontend. AI-powered guidance accelerates root cause analysis and simplifies on-call, while customers like Citigroup, Salesforce, Uber, and ASOS scale with confidence.

Honeycomb Observability Day SF - Kesha Mykhailov, Fin.ai: Human-Centric Observability in AI Systems

Empathy is one of the superpowers of modern teams, especially when building tools that interact with humans. This talk by Kesha Mykhailov tells the story of Fin, Intercom's Customer Support agent, and how they transformed their approach to Fin's.

Sending beers all across Belgium, a throwback to how we named Oh Dear

We're obviously a little biased, but we believe we have one of the best website monitoring tools on the market today, leading in features compared to our competitors. We've already tried a variety of marketing techniques to promote our service, but none really had the impact we were looking for. Maybe we're better at actually building good software than we are at marketing it? Or are we trying what everyone else is also doing, thus making it all harder?

Agentic AIOps in Action: LogicMonitor, IBM, and Red Hat Deliver Self-Healing IT

Your most skilled engineers shouldn’t be spending nights and weekends piecing together root causes of outages. Yet many organizations still rely on manual incident response across sprawling hybrid and multi-cloud environments. The result: slower resolution times, frustrated customers and lost revenue that can reach up to $1 million per hour according to IDC. At LogicMonitor, we believe the answer isn’t just better monitoring. It is systems that can heal themselves.