Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on AIOps, alerting in complex systems and related technologies.

Resolve's Agents of IT podcast - S2Ep5 - Ari's Hot Takes #itautomation #claude #aiautomation #ai

In this episode of Agents of IT, Ari Stowe and Ian Coppock unpack the recent Claude outage and what it reveals about our growing dependence on AI at work. From developers suddenly returning to Stack Overflow to the infrastructure challenges behind AI scaling, the conversation explores what happens when AI becomes critical enterprise infrastructure. They also discuss how organizations should prepare for AI outages, why “stampede adoption” is the new reality of AI releases, and what resilient, multi-agent architectures could look like going forward.

Bring Clarity and Confidence Back to Ops: How Trustworthy Guidance Sets a New Standard

For years, enterprises have chased the promise of artificial intelligence as a remedy for growing operational complexity. It seemed logical that if environments were expanding faster than teams could keep up, smarter models could fill the gap. But early deployments of generic AI proved a difficult truth. Intelligence alone does not create operational clarity. It does not guarantee safety.

Episode 6 - The evolution from automation to autonomy

Tom and Akhilesh unpack why automation alone will never deliver autonomy, and why intelligence means anticipating change rather than constantly reacting to it. They explore the role of people in enterprise transformation, the limits of technology without trust and context, and why the most powerful use of AI is freeing humans to focus on what they do best. Plus, Akhilesh makes the case for ping pong as a surprisingly effective way to reset when the pressure is on.

Full-Stack Observability Is Becoming a Business Imperative

As enterprises accelerate digital transformation, technology performance has become inseparable from business performance. Customer experiences, revenue streams, and operational efficiency increasingly depend on the reliability of complex, distributed systems. In this environment, full-stack observability is no longer a technical aspiration — it is a strategic necessity.

The Speed of Clarity: How Grounded Context Transforms Triage and Strengthens Operational Decision-Making

Modern operations move at a pace that leaves little room for ambiguity. When an incident emerges, teams must determine what is happening and how best to respond. Yet triage often slows under the weight of fragmented data, noisy alerts, and limited shared understanding across engineering groups. These conditions stretch routine issues into drawn-out investigations and delay action exactly when teams need to move with purpose.

Enabling Proactive ITOps with Skylar Advisor

By continuously connecting signals across your IT environment, Skylar Advisor turns operational complexity into clear, prioritized guidance. It highlights potential impact, explains why it matters, and delivers clear next steps so IT teams can act early and stay ahead of alerts before they turn into issues.
Sponsored Post

Fabrix.ai at Cisco Live 2026 Amsterdam

This post highlights the biggest Cisco AI Summit takeaways that came up again and again in Cisco Live conversations, and what they mean for teams operating AI in production. If you are following the broader AgentOps movement and the rise of agentic workflows, Fabrix.ai’s point of view is grounded in a core idea: AI agents create value only when they can be operated safely and consistently. A good starting point is here: Fabrix.ai’s approach to agentic.