Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on APIs, Mobile, AI, Machine Learning, IoT, Open Source and more!

API Health Monitoring Explained: How to Detect Silent Failures That Health Checks Miss

APIs sit at the center of modern digital systems. They power mobile apps, enable partner integrations, and connect internal services across distributed architectures. When an API fails, the impact is immediate: broken user journeys, stalled transactions, and downstream systems that quietly stop working. That’s why API health monitoring is now a core reliability practice for modern engineering teams. The problem is that “API health” is often defined too narrowly.

Observability for GenAI Applications (Grafana OpenTelemetry Community Call)

In this episode, we’re diving into observability for Generative AI apps. AI helps us write code and monitor applications in production - but how do we observe the AI itself? And how do we make sense of complex, non-deterministic AI systems? We’re joined by two great guests: Ishan Jain, working on GenAI observability and Luccas Quadros, working on Grafana Assistant. Together, they bring both platform-level insights and real-world perspectives.

What is ServiceNow's AI Control Tower?

What happens when AI agents stop being scattered and start being steered? Customer service queues shrink, teams get time back for high-value work, and everyone finally works off the same data. That’s the power of the ServiceNow AI Control Tower—all your AI, all under control. No more fragmentation. No more busywork. Just visibility, control, and workflows that scale across the entire business.

Elephant in the Room, Episode 3: Building a CFP Review Platform with PostgreSQL & Django (Live)

In Episode 3 of Elephant in the Room, we move from theory to practice, building a real, community-focused application live on stream. Join Jay Miller (Aiven), Abigail Dogbe, and Andres Pineda as they collaboratively design and build a CFP (Call for Proposals) review platform using PostgreSQL and Django. The goal: create a practical tool that helps speakers get better feedback on their talks, and helps organizers discover new, diverse voices.

GitKraken Insights | Engineering Intelligence in Minutes

Most software intelligence tools take months to implement, cost a fortune, and end up collecting dust. GitKraken Insights is different. It helps engineering leaders measure what matters: AI impact, code quality, delivery performance, and developer experience, all in one place. It’s the latest evolution of the GitKraken DevEx platform, trusted by over 40 million developers. Insights connects data from across your GitKraken tools to give you a complete picture of engineering health and value. We're talking DORA metrics, pull request metrics, and AI impact.

MCP: Why AI Needs Git Intelligence

GitKraken CTO Eric Amodio breaks down the Model Context Protocol (MCP) and explains why Git intelligence is critical for AI agents at GitKon 2025. In this session, Eric covers: What MCP is and why every major AI company adopted it Why AI needs Git history, not just file system access How GitKraken MCP removes Git pain safely The future of agentic developer workflows How Commit Composer uses AI to organize commits without losing data.

How to Troubleshoot BGP Faster with Kentik AI Advisor

A BGP session goes down because a transit provider exceeded the maximum prefix limit. How do you find the root cause — fast? In this 10-minute demo, we walk through two approaches using Kentik AI Advisor. First, we troubleshoot step by step using natural language: asking AI Advisor to identify the affected interface, check for interface flapping, and review syslog messages until we find the maximum-prefix violation. Then we show how custom network context and natural language runbooks let AI Advisor do the entire investigation autonomously — following the same four steps a senior engineer would.

ChatOps that actually works: Grafana Cloud, Slack, and AI-powered observability

Context switching isn’t just inefficient—under pressure, it’s exhausting. It slows decision-making, increases the risk of mistakes, and makes even experienced engineers feel like they’re always a step behind the system they’re responsible for. At Grafana Labs, we want to build tools that meet you where you are. That's why we embedded Grafana Assistant, our context-aware AI assistant, directly in Grafana Cloud.

Measuring Claude Code ROI and Adoption in Honeycomb

At Honeycomb, we’ve been using Claude Code across our engineering team for a while. Anecdotally, I had a sense of who the power users were, and I had seen some examples of complex usage. But I wanted to be able to confidently answer questions, like: Claude Code supports OpenTelemetry out of the box, which means sending telemetry to Honeycomb takes just a few minutes of configuration.