Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Stop the guesswork: Troubleshoot with confidence with process monitoring

If your organization runs on tech, everyday issues can be expected. This includes application downtime, erratic connectivity, and failures in remote access, database reachability, site-to-site VPNs, and web-based services. But how do you know if an issue is caused by: Sysadmins usually learn the root cause of an issue after a ticket comes in from the team or customer.

AI Isn't Here to Replace Your Dashboard... Yet

Non-deterministic UIs are the future and will replace your dashboards, but they’re not here yet. So until then, we’re stuck with conversational interfaces. In an effort to try and describe what I consider the future of UIs to look like, I wrote about how you (and I) have been designing dashboards wrong. The core insight was that we've been designing for static representations of data that sit on a TV in the office, when the actual use case is someone at a desk using them to debug an issue.

Architecture for the agentic era: How AI will reshape data, security, and observability

As AI agents move from copilots to autonomous systems, they’re generating and consuming data at unprecedented scale. The result is a new kind of infrastructure pressure — one that’s quietly reshaping how organizations think about data, cost, and control. Across IT, Security, and Observability, leaders are realizing a hard truth: too much data is too costly.

Lightweight Open-Source APM with OTel Demo (Grafana OpenTelemetry Community Call)

We’re back with the second Grafana OpenTelemetry Community Call! Join us as we continue exploring how to get observability into your apps and infrastructure with Grafana, powered by OpenTelemetry. In this session, we’ll walk through the basics of application monitoring using the OpenTelemetry Demo — a realistic example of a distributed system built on a fully open-source stack: Prometheus, Jaeger, and OpenSearch, with dashboards powered by Grafana.

How to Choose the Best Synthetic Monitoring Solutions & Software

To have a fast and reliable experience digitally you would need to do more than resolving issues. This is why people prefer synthetic monitoring which simulates real user actions with regular intervals. Using this method, businesses can detect performance shortcomings and any technical issues. From testing website load to full flow checkout, everything can be tested before users face any issues.

AI for Good: Securing Networks in the Age of Autonomous Attacks

The rise of autonomous AI attacks operating at machine speed demands that network security evolve beyond human capacity and manual processes. Kentik AI Advisor counters this threat by using AI for good, reasoning across full network context to proactively eliminate vulnerabilities and guide immediate, confident defense.

AI Workload Infrastructure Requirements: What You Actually Need

Artificial intelligence (AI) infrastructure requires four pillars working in tandem as a system (compute, storage, networking, and orchestration) tailored to your actual workload needs, not hype. Artificial intelligence (AI) infrastructure isn’t just more hardware. It’s a new class of system—highly distributed, resource-intensive, and tightly coupled across compute, storage, and network layers.

AI Monitoring, Explained: Challenges, Core Components, and Why Observability Is the Next Step

Monitoring AI systems isn’t business as usual. Monitoring AI isn’t like monitoring traditional systems. You can’t just track uptime or response times and call it a day. AI models evolve, data shifts, and behavior drifts over time, which means your monitoring has to evolve, too. If you’re running AI workloads in production, you already know this. Your models might look healthy according to your infrastructure metrics, but they’re still making bad predictions.