Operations | Monitoring | ITSM | DevOps | Cloud

Elephant Flows: The Hidden Heavyweights of AI Data Center Networks

Elephant flows are no longer rare. They’re foundational to AI workloads. In today’s GPU-heavy data centers, long-lived, high-volume flows can distort ECMP, overflow buffers, and rack up unexpected cloud bills. Kentik helps you see and tame these elephants with real-time flow analytics, automated alerting, and predictive capacity planning.

Can Claude Code Observe Its Own Code?

One of the great things about OpenTelemetry is that it’s a standard, and standards tend to proliferate. I was excited to see Claude Code add OpenTelemetry metric and log support in a recent release. What was really interesting—beyond the ability to capture usage data from Claude Code—is that you can also get pretty detailed logs about what you’re doing with Claude Code.
Sponsored Post

Almaden CEO Leandro Silva Joins Key Discussion on the Digital Future of Business and the Role of AI

On the morning of Thursday, June 12, the São Paulo office of L.O. Baptista Advogados hosted a high-level event titled "Innovation and AI: The Digital Future of Business." The gathering brought together a diverse and engaged audience of legal and tech professionals to discuss how artificial intelligence is reshaping strategic decisions and transforming modern enterprises. Among the featured speakers was Leandro Silva, CEO of Almaden Inc., who joined a dynamic and interactive panel exploring the opportunities and challenges of using AI responsibly in corporate environments.

Going beyond AI chat response: How we're building an agentic system to drive Grafana

As we look at the role AI can play in Grafana going forward, we want to move beyond the simple chat responses that dominate the world of LLMs today and into agentic systems—AI that can understand, reason, and act on your behalf. The ultimate goal is to make it easy to get things done in Grafana using natural language—whether you’re a seasoned SRE or a new developer. And in the AI world, we call this moving from chat completion to task completion.

How Puppet is Redefining Infrastructure Management with AI, Powered by Perforce Intelligence

AI has emerged as a defining force in modern technology, spearheading transformation across industries. Yet, despite its promise to revolutionize workflows and unlock unprecedented efficiency, most DevOps organizations face significant hurdles in adopting AI safely and effectively. Concerns about complexity, scalability, and governance hold many decision makers back.

Demo Roundups! Meet the PagerDuty AI Agents

Welcome to the future of operations, where people and agents manage critical work together, driving productivity and efficiency. Learn how PagerDuty’s AI agents can supercharge teams, by autonomously handling repetitive tasks and resolving well-known issues, while surfacing data and insights that augment human expertise for faster resolution and higher operational resilience.

How we're shipping faster with Claude Code and Git Worktrees

Four months ago, Claude Code was announced and we were requesting invites to its "Research Preview." Now? We've gone from no Claude Code to simultaneously running four or five Claude agents, each working on different features in parallel. It sounds chaotic, but it's been a natural progression as we've learned to trust AI more and as the tools have dramatically improved.

AI Arms Race: How Artificial Intelligence is Both the Weapon and the Shield in Modern Cyber Warfare

Cybercriminals are no longer lone hackers sitting behind screens; they use artificial intelligence to supercharge their attacks. They employ large language models (LLMs) that generate phishing content to evade human detection. They use malware that adapts mid-execution to bypass sandbox environments and deepfake video/audio to mimic executives in real time.

Can AI/ML Guide Observability? Tech Talk #6

This talk will examine the application of Artificial Intelligence and Machine Learning in observability. It will cover how AI/ML is being used to monitor systems, detect anomalies, and extract insights from telemetry data. The session will provide information on integrating AI/ML into observability pipelines, improving analytical capabilities, and system performance.