Operations | Monitoring | ITSM | DevOps | Cloud

How to Use an AI Assistant with Your Monitoring System - VictoriaMetrics MCP Server

Alex Marshalov explores the new VictoriaMetrics MCP Server. He moves beyond the hype to show what's truly possible today. The presentation offers a builder's perspective on integrating AI with time-series data, featuring a demo that showcases both the potential and the current realities (yes, there are some). See how we're thinking about solving complex monitoring challenges with AI. Resources for Further Learning.

Beyond the code: Shipping faster with AI with Leo P.

We’re running a short mini-series on The Debrief podcast called Beyond the code, where we interview our engineers about what it’s really like to build at incident.io. In this episode, we chat with Product Engineer Leo about how we’re using AI tools like Claude Code to ship more product, more quickly.

Introducing AI Agent Monitoring

AI is changing how we build software — but debugging code still comes down to having context. One minute the model’s performance is cruising. The next, you’re hit with a KeyError from a tool you forgot existed, triggered by a model that silently timed out, and a retrieval call that returns... nothing, or 11 “Let me try this a different way" messages before failure. You’re stitching together LLM calls, agents, vector stores, and custom logic. Then hoping it holds up in prod.

Introducing AI Agent Monitoring in Sentry

Monitoring agents and LLM applications is... different. Managing everything from tool calls, to model configurations, token usage, and AI systems do their best to solve problems on their own - so errors aren't always clear. Sentry's agent monitoring focuses on making it easy to dive into your AI applications and understand whats breaking, where, so you can fix it faster.

Engineering Excellence in the Age of AI: It's Not Dead, It's Maturing

On a recent episode of The Product Manager podcast, Cortex CEO Anish Dhar joined host Hannah Clark to challenge a growing narrative: that software engineering is obsolete in the age of AI. His take? Engineering isn’t disappearing, it’s maturing. At Cortex, we work with some of the most forward-thinking engineering organizations at companies like Canva and Fanatics.

Introducing Cause Analysis: Instant Triage for Traffic Changes with Kentik AI

Introducing Cause Analysis from Kentik, designed to simplify network traffic analysis and rapidly identify the root cause of issues. Learn how this exciting new feature streamlines troubleshooting, makes complex insights accessible, and boosts team efficiency for all users.

GPU Powerhouse: Scaling an AI Cloud in the Heart of Europe

The AI revolution needs more than models - it needs massive infrastructure. And Julien Gauthier is building it. In this episode of Uplink, Julien, CEO of Arkane Cloud, joins host Michael Reid to unpack how his company scaled from 3D rendering and gaming to delivering GPU cloud services for AI workloads across the globe. We explore how Arkane built a 1,000-GPU cluster in Paris (with capacity for 6,000), the rise of inference workloads in Europe, and the real-world engineering and business challenges of deploying high-density infrastructure - including cutting-edge liquid cooling handling 135kW per cabinet.

How Sentry's Seer AI Agent passes legal review: a guide for legal teams reviewing Seer

If your legal department is anything like ours, you’re being inundated with requests from the business to use more and more AI tools. Whether it's developers wanting to use coding agents like Cursor, to security implementing AI-driven investigations, to sales and marketing leveraging AI for call insights and competitive research, we've seen a shift in what teams are trying and buying.

Puppet Infra Assistant: AI-Powered Natural Language Queries

Finding critical infrastructure insights shouldn't be a game of hide-and-seek. The new AI-powered Infra Assistant is a natural language interface that allows users of any skill level to chat with Puppet data and services for quick insights and reporting on infrastructure state. You don't need any Puppet experience to get started; it's safe to use in your infrastructure; and it's secured with explicit opt-in and robust role-based access control.