Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Your AI App Is Lying to You - Here's How to Fix That #devops #observability #programming

You shipped your AI app. But do you have all the answers? Do you actually know which model ran, how many tokens it consumed, or why it stopped? This is what LLM observability gives you, and most AI engineers are skipping it entirely. I built an SOS detection app and used OpenTelemetry to get full visibility into every single call. Token usage, model version, finish reason, and cost per call all in one place, standardised across any provider. Check out the OpenTelemetry GenAI docs in the link below; there is a lot more you can track than you think.

How to generate real-world load tests using Grafana Cloud k6 and production telemetry

For many development teams, a load test starts with a set of assumptions. You pick 100 virtual users because it sounds reasonable. You ramp for 30 seconds because that's what the tutorial showed. You set a 500ms threshold because it feels like a good target. The test passes, you ship the release, and production falls over at 6 p.m. on a Tuesday because your synthetic load never resembled how real users interact with your application.

Autonomous Error Remediation in Cursor with Lightrun MCP

Lightrun's Gidi Freud demonstrates how your AI coding agent can now investigate and fix production errors, autonomously. Watch how Cursor, guided by Lightrun's Error Remediation skill, picks up a Sentry error, instruments the live service with a runtime snapshot, captures real evidence, and opens a validated PR for approval.

May 2026 product updates

We’ve been busy shipping new features and enhancements to help you monitor critical services more effectively, investigate incidents faster, and customize your StatusGator experience. This month’s updates include historical outage reports, our new Datadog integration, expanded monitoring coverage in Asia Pacific, improved email branding options, and performance upgrades for monitor metrics. We also crossed a major milestone with more than 8,000 services now monitored by StatusGator.

IBM Think 2026 Infrastructure Insights for IT Leaders

IBM Think 2026 made one thing clear: infrastructure leaders are being asked to support more AI, more automation, and faster decision-making without adding unnecessary complexity or risk. Held earlier this month in Boston, IBM Think 2026 focused heavily on enterprise AI, hybrid cloud, automation, governance, and operational transformation.

DataPrime at ingest (DPXL): See the impact of any routing decision

TCO policies have always been one of the most impactful cost levers in Coralogix. Route business-critical data to High, push monitoring data to Medium, archive compliance logs to Low. With the addition of DataPrime expressions (DPXL) – a subset of the DataPrime query language designed for inline filtering at ingest – that routing became even more precise, matching on any field in the event payload, not just application, subsystem, and severity.

Federated Search | From Silos to Insight | Azure Blob Schema Discovery with Splunk's Crawler

This walk-through shows how Splunk's Cloud can discover schema and partition keys for Microsoft Azure Blob Storage datasets and create searchable Splunk managed tables. Once the data is mapped, analysts can use Splunk Federated Search to query Azure Blob data where it lives, bringing cloud-resident logs into security, observability, and operational work-flows without re-ingesting the data.

The Observability Journey: Getty Images and Cribl

I recently sat down with Simon Overbey and Lovepreet Singh - the Engineering Manager and systems engineer (respectively) at Getty Images to talk about their experiences implementing Cribl. After getting a rundown of the pre-Cribl environment (described above) I asked to jump straight to the end, the net benefits. If the "before" was a terrifying tidal wave of cost and complexity, what did the "after" look like?