Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Live in London: Adoption & AI Confessions @ Nexthink Experience

Tim and Tom are back with another special DEX Show Live!—recorded last week at Nexthink Experience London at the Intercontinental by the iconic O2. Day 1 of the world's biggest DEX event saw over a thousand IT pros gather for two days of innovation, insight, and energy. In this lively episode, the hosts are joined by Guillaume Charles, Senior Director of Product Management (Diagnostics) at Nexthink, and Gabriela Moraes from Electrolux to explore the state of digital and AI adoption in the enterprise.

Downtime on the Docket: The Death Sentence for Productivity in Legal Firms

When minutes matter, IT leaders need more than quick fixes; they need foresight. That’s where Teneo’s Managed DEX (Digital Experience Monitoring) comes in. Managed DEX is designed to detect what legal teams can’t afford to miss. It monitors for “ghost traffic”- those eerie, unexplained signals of abnormal network activity that often signal compromise or instability- and other anomalous device behaviors that can precede full-blown outages or cyber incidents.

SRE Report Retrospectives - Have AIOps Predictions Held Up?

Welcome to a new blog series where we take a candid look at the predictions, insights, and bold claims we've made in previous SRE Reports and ask the uncomfortable question: How did we do? For the uninitiated, Catchpoint's SRE Report is our annual, practitioner-driven effort to capture the pulse of the global reliability community.

Announcing Honeycomb for Frontend Observability React Native Beta

React Native apps straddle two worlds: JavaScript powering your UI and native modules running underneath. Add in backend services, and when something goes wrong, there are many possible culprits. Was it JS logic, the native bridge, the native API call, or a downstream API call? Most tools give you parts of the picture. A crash tool can tell you where the app failed but not what else happened in a session.

Get Third-Party Outage Alerts in Discord with StatusGator

When SaaS tools go down, teams need fast, reliable alerts right where they communicate. Now, with the StatusGator integration for Discord, you can receive real-time third-party outage alerts directly in your server. Whether you’re monitoring the status of AWS, Slack, GitHub, or Google Workspace, StatusGator keeps your team informed instantly when disruptions happen.

Pastries with SREs: Leveling up observability and donut dunkability

In this episode of Pastries with SREs, we explore what it really means to shift left with observability, moving from reactive firefighting to proactive performance. And yes, it starts with donuts. We unpack how SREs and IT Ops teams are often stuck reacting to incidents, battling alert fatigue and swivel-chair triaging. But what if you could pull in developers earlier, and give everyone a unified view of observability data?

Observability-as-Code: Bring synthetic monitoring into your pipeline

Your team just deployed to production. The infrastructure spun up in 90 seconds, but recreating your monitoring? That’ll take hours. It’s added late in the process, managed through dashboards, and prone to inconsistency. Short-term, this slows delivery and creates visibility gaps that surface only during incidents. Long-term, it leaves a business-critical capability out of your observability pipeline.

The observability maturity curve: How IT leaders are shifting from tools to outcomes

Observability has come a long way from its origins in monitoring logs and metrics. Today, it sits on a maturity curve: Organizations move from fragmented tool stacks to unified platforms to proactive engineering practices that tie reliability to business outcomes. To better understand where IT leaders are on this curve, Grafana Labs surveyed 150 decision-makers across industries in advance of ObservabilityCON 2025.