Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on AIOps, alerting in complex systems and related technologies.

Why Alert Fatigue Is Killing Your MTTR

Every minute counts when production systems go down. Yet the average enterprise NOC team receives over 1,000 alerts per day, according to a 2025 study by OpsRamp. Of those, fewer than 5% require human intervention. The rest? They are noise — redundant, low-priority, or symptomatic signals that bury the genuine incidents demanding immediate attention.

Episode 10 - How I Learned to Stop Worrying and Love AI

Are we still in the first chapter of AI, and mistaking it for the whole story? In this episode of The Intelligent Enterprise, host Tom Stoneman zooms out from the headlines to explore where we really are in the AI journey. He’s joined by journalist and independent analyst Joe McKendrick, who has spent decades documenting how emerging technologies reshape business and society. As co-chair of the AI Summit in New York and a senior contributor to Forbes and ZDNet, Joe brings the perspective of someone who understands how these stories unfold over time.

The New Economics of Enterprise AI: Why Small Models Win Where It Matters

For years, progress in AI was equated with scale. Larger models, broader parameter counts, and increasingly complex cloud architectures were treated as signals of advancement. In enterprise operations, however, scale alone does not determine success. Economics does. As AI becomes embedded in operational workflows, organizations are discovering that model size is less important than cost stability under continuous load. AI-driven operations do not run in bursts. They run constantly.

Bridging IT and OT: Lessons from the Factory Floor with Steve Goudreau

Everyone’s rushing to AI, but few have the foundation to make it work. In this episode of Next Gen Network Heroes, Bob sits down with Steve Goudreau, Director of IT at Ice Industries, to explore what it really takes to lead in today’s evolving technology landscape. With over three decades of experience, spanning military service, financial services, and manufacturing, Steve brings a grounded, people-first perspective to an industry often obsessed with tools and trends.

Why Threshold Monitoring Fails in Distributed Systems

For years, infrastructure stability could be approximated through static limits. If CPU utilization exceeded a defined percentage or response time crossed a fixed boundary, risk was assumed to increase in a predictable way. Monitoring systems were designed around that assumption, and for contained environments, it largely held true.

Building the AI Stack for Modern Network Operations - Surya Nimmagadda

AI is rapidly transforming network operations — but what does it actually take to build an AI stack that works in production? In this session from AI for Network Leaders – Powered by Selector, Surya Nimmagadda breaks down how modern AI systems for network operations are designed, deployed, and used today. He covers: This session is designed for network engineers, architects, and operators looking to move beyond theory and understand how AI is being applied in real production environments.

Frontline Truths: 100+ Network War Stories on the Path to Autonomous Operations - Eric Chou

The path to intelligent network operations isn’t a straight line. In this session from AI for Network Leaders – Powered by Selector, Eric Chou shares hard-earned lessons from over 100 conversations with network engineers and operators navigating automation, complexity, and the shift toward AI-driven operations. He covers: This session is a practical field guide for teams looking to move from reactive firefighting to building an AI-ready network foundation.

You Don't Have an AIOps Problem-You Have a Data Opportunity - Michael Wynston

AI can’t fix bad data. In this session from AI for Network Leaders – Powered by Selector, Michael Wynston breaks down a critical truth: the success of AIOps depends on the quality, consistency, and trustworthiness of your network data. Using real-world lessons from Fiserv’s large-scale network transformation, he explores how teams can build a strong data foundation that enables AI to deliver meaningful, low-noise outcomes.

Inside the AI Agents Transforming Network Operations - Joby Rudolph & James Schnebly | Selector

AI agents are becoming a core part of modern network operations — but what does it actually take to build and deploy them effectively? In this session from AI for Network Leaders – Powered by Selector, Joby Rudolph and James Schnebly break down how AI agents are designed, implemented, and applied in real-world network environments. They cover: This session provides a practical look at how AI agents are moving from concept to production — and what it takes to make them work at scale.