Operations | Monitoring | ITSM | DevOps | Cloud

Highlights from AWS re:Invent 2025: Making sense of applied AI, trust, and going faster

After four days of AWS re:Invent—a 65,000-step marathon that included 60,000 attendees spread across five Las Vegas campuses—and navigating the latest installment of this 13-year-old cloud pilgrimage, we’re all a little dehydrated but significantly wiser. The volume of announcements felt less like a single flood and more like a river branching into three powerful currents. Making sense of this massive technological convergence requires zooming out.

The War Room of AI Agents: Why the Future of AI SRE is Multi-Agent Orchestration

We’ve all been there. It’s 2 AM, your phone is buzzing with alerts, and you’re suddenly thrust into an incident war room with a dozen other bleary-eyed engineers. The production environment is on fire, customers are affected, and everyone’s trying to piece together what went wrong. But here’s what makes these moments fascinating from a systems perspective – it’s rarely just one person silently fixing the issue in isolation.

How to Build a Clear AI Implementation Strategy

Organizations see AI’s transformative potential, but success requires more than technology – it demands a clear strategy led by IT. A structured AI implementation roadmap aligns initiatives with business goals, establishes governance, and enables measurable ROI, while improving employee and customer experiences. Yet, 66% of organizations view AI as critical, but only 38% report meaningful competitive advantage, highlighting the need for disciplined adoption.

Capture and Use Network Response Data in AI Powered Testing

Learn how to capture and use response data from network calls to build smarter and more reliable AI-driven tests. This walkthrough covers the full workflow from configuring user actions to extracting backend responses, validating data, and creating dynamic test flows. You will also see how response data improves debugging visibility and supports data-driven automation. The video includes Ideal for developers, testers, and platform engineers looking to improve the accuracy and resilience of AI-powered test suites.

Introducing Workspace: Where DEX Work Happens

Today marks another milestone for Nexthink as we introduce a powerful evolution of our platform, one that will meaningfully expand how customers derive value and empower many more teams across IT, HR, and the business to use Infinity. Welcome to Workspace: a new destination where the future of DEX and IT work comes together.

Runtime Context for AI Agents with Lightrun MCP

Introducing Runtime Context for AI agents The next evolution in autonomous software development. The Lightrun MCP connects IDEs and AI assistants to real runtime data, giving agents and developers the context they need to write, validate, and debug code with confidence. With Runtime Context, AI agents can: Reliable, AI-accelerated engineering starts here.

Agentic AI by Design: Evolving Our Principles for the Next Chapter of Responsible AI

Join SolarWinds CISO Tim Brown and CTO Sai Krishna for the SolarWinds Day Closing Keynote, where they share how SolarWinds is evolving from Secure by Design to AI by Design—a bold next step in building trusted, intelligent, and future-ready IT operations. As organizations adopt AI-driven systems, embedding trust, transparency, and accountability into product development becomes essential. In this forward-looking discussion, Tim and Sai reveal how the AI by Design framework ensures responsible AI adoption while enhancing performance, reliability, and security.

Datadog at AWS re:Invent, Bits AI SRE, MCP Server, CloudPrem, and more | This Month in Datadog

Get a closer look at features we announced at AWS re:Invent in the latest episode of This Month in Datadog. Tune in for spotlights of Bits AI SRE, now generally available, and Datadog’s MCP Server, which connects AI agents to our platform by ingesting prompts and mapping them to Datadog resources and data. Plus, we cover how to: This Month in Datadog brings you the latest updates on our newest product features, announcements, resources, and events.

Sage AI: Dashboard, events, knowledge base

It's starting to take shape. We have a dashboard, we're collecting some metrics, and I'm getting a daily briefing every morning. Also, I have an event log where all the events are going into (the spine of the system), and there's a knowledge base which consists of a GitHub repository which is vectorized and indexed. Its first use is adding context to Herald, the agent that sends me the morning briefing. More details to come.