Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on APIs, Mobile, AI, Machine Learning, IoT, Open Source and more!

Using AI + Rollbar's Session Replay to Understand Complex Errors

Front‑end bugs are notoriously hard to reproduce. By the time an error shows up in your monitoring tool, the most important context is already gone: what the user actually did. Session replay helps—but only if someone has the time and patience to scrub through recordings, correlate events, and form a hypothesis. That’s where Rollbar’s MCP server, paired with an AI agent like Github Copilot, changes the game.

Elephant in the Room, Episode 2: Working with Feature Flags, Read Replicas, and Postgres Analytics

Welcome to “Elephant in the Room – Presented by Aiven.” In this live series, we dig into the real-world challenges developers face when building modern applications on PostgreSQL, the elephant everyone depends on but few fully leverage. Episode 2: Join Jay Miller, Staff Product Advocate at Aiven, and special guest Alexis Roberson – senior developer educator, and feature flag enthusiast – as they explore how developers can use feature flags to ship faster, reduce risk, and create better user experiences with PostgreSQL-backed applications.

Using AI + Rollbar's Session Replay to Understand Complex Errors

Front‑end bugs are notoriously hard to reproduce. By the time an error shows up in your monitoring tool, the most important context is already gone: *what the user actually did*. By letting an AI agent like Copilot analyze Rollbar's session replay data directly, teams can move from *“something broke”* to *“here’s exactly why it broke”* in minutes, not hours.

How to use AI to analyze and visualize CAN data with Grafana Assistant

Note: A version of this post originally appeared on the CSS Electronics blog. Martin Falch, co-owner and head of sales and marketing at CSS Electronics, is an expert on CAN bus data. Martin works closely with end users, typically OEM engineers, across diverse industries, including automotive, maritime, and industrial. He is passionate about data visualization and AI—and he’s been working extensively with Grafana Assistant.

JWT Rot: Why Traffic Replay Tests Expire #speedscale #jwt #trafficreplay #apitesting #testautomation

Are your traffic replay tests crumbling because of expired tokens? You've got JWT Rot! When recording production traffic for integration or load testing, the embedded JSON Web Tokens (JWTs) often have a short expiration date. Once those tokens expire, your entire test suite fails, rendering your valuable traffic snapshots useless. Stop wasting time re-recording traffic. Learn how to defeat JWT Rot and ensure your security and API tests run reliably every time!

AI Reliability, Part 2: When the Datacenter Becomes the Bottleneck

In Part 1, we talked about all the hidden complexity inside AI systems: the pipelines, GPUs, embeddings, vector databases, orchestration layers, and everything else that quietly determines how reliable an AI-first product really is. But all of that software still rests on something far less glamorous: the physical infrastructure underneath it.

Why AI agents need a common data model #ai #telemetry

Clint Sharp explains why a common model like OCSF is critical for the future of AI. Agents need standardized data to analyze information effectively on your behalf. He contrasts the traditional manual workflow of checking Slack, tickets, and wikis while asking colleagues with a future where AI fuses this human context with machine data. Instead of just search results, AI agents will hand you examined hypotheses so you know exactly where to take your investigation.

How AI Agents automate incident response #ai #cybersecurity #telemetry

Clint Sharp demonstrates how Cribl Search leverages AI to streamline incident investigation. Starting from a Slack channel, the AI builds an interactive notebook, analyzes order processing logs, and identifies suspicious traffic spikes. It connects high CPU usage to a recent Jenkins deployment, hypothesizing a supply chain attack, and ultimately recommends a rollback. This isn't a far off concept. It is the future of operations arriving right now.