Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Nonsense Networking: Tech Talk #8

Ever feel like getting simple data from your network is way harder than it should be? You're not alone. With so many devices, the amount of data can be overwhelming, making it tough to see what's actually happening. In this stream, we're breaking down the common frustrations with network monitoring. We'll cover: The SNMP Problem: We'll start with why the "standard" method, SNMP, is often a pain. We'll look at the challenge of finding the right MIBs and OIDs just to get tools like Telegraf or Prometheus to work.

(ServiceNow + Kentik) From Reactive to Proactive: The Rise of Agentic Networks

Agentic AI is not just hype—it’s a force multiplier that enables infrastructure and operations teams to do more, with less effort, in less time. Importantly, it helps IT teams compress time to resolution and even proactively detect and respond to issues, before they escalate.

How to Monitor OTP-Protected Web Applications

If you’ve ever used an online banking application to complete a transaction or gone through a checkout on an e-commerce platform, chances are you’ve utilized or interacted with an OTP-protected application. One-Time Password (OTPs) are at the center of most multi-factor authentication (MFA) systems. OTPs are temporary codes delivered by SMS, email, authenticator apps, push notifications, etc.

95% of AI Pilots Fail - Here's How to Be the 5%

When MIT released research showing that 95% of enterprise AI pilots fail to deliver measurable business impact, it made headlines for a reason. After years of heavy investment in artificial intelligence, the vast majority of organizations still haven’t moved beyond pilots that promise much but deliver little. This doesn’t mean AI itself is broken. In most cases, the technology performs as intended.

Why Do SSL Certificates Fail in Multi-Cloud Environments (AWS, Azure, GCP)?

SSL certificates keep websites and apps secure, but in AWS, Azure, and Google Cloud Platform (GCP), misconfigurations or expirations can still cause services to go offline. Why do these failures happen, and how can you prevent them?

A Practical Guide to Python Application Performance Monitoring (APM)

When your Python app starts slowing down, maybe queries are taking longer, memory keeps creeping up, or API calls are lagging—basic server metrics won’t tell you why. You need to see what’s happening inside the application itself. That’s the role of Application Performance Monitoring (APM). It gives you a breakdown of database queries, external API calls, memory usage, error rates, and more, so you can connect the dots between code and performance.

Serverless Applications: Why Monitoring is Essential for Speed and Reliability

Serverless applications are becoming the go-to architecture for modern developers. Startups and enterprises are building serverless applications because they offer scalability, cost-efficiency, and flexibility. However, these advantages come with unique challenges, especially when it comes to monitoring serverless applications. Traditional server monitoring tools fail to capture short-lived functions, making serverless application monitoring essential for maintaining performance and reliability.