Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Robust Time Series Monitoring: Anomaly Detection Using Matrix Profile and Prophet

Monitoring production systems often feels like searching for a moving needle in a constantly shifting haystack. At Sentry, our goal was to empower customers to move beyond traditional threshold and percentage-based alerting. We aimed to help them detect subtle and complex anomalies in their systems in near real-time. This post will detail how our AI/ML team developed a time series anomaly detection system using Matrix Profile and Meta’s Prophet.

Dynamic Status Pages on Demand

Clients expect transparency - especially when things go wrong. But manually updating a status page during an incident or maintenance window slows you down when speed matters most. Oh Dear’s status pages are more than just a pretty uptime dashboard. They’re fully API-driven and designed to scale with your workflow. Whether you manage five client sites or five hundred, you can create, update and sync status pages as needed. Here’s how to do it.

Effortless customer monitoring with Site24x7's MSP Customer Health View

As a Managed Service Provider, staying on top of your customers’ monitor statuses shouldn't be a hassle. With Site24x7's Customer Health View, you get a centralized, real-time summary of every customer account you manage. Access monitor statuses, alarm counts, and overall account health—all in one place. Switch between List View and Grid View, apply filters to prioritize issues, and let auto-refresh keep you up to date every five minutes.

Debug smarter with Session Replay in Site24x7 real user monitoring (RUM)

Frontend errors can be tricky to trace without context. Site24x7's Session Replay gives developers, SREs, and DevOps teams complete visibility into the user journey by capturing every click, scroll, and interaction as it happened. With visual replays and correlated performance data, you can quickly identify what went wrong, why it happened, and how to fix it—without relying on user screenshots or log reports.

Netdata: The Fastest Path to Full Stack Observability. AI Powered.

Netdata is a real-time, high-performance and on-premises observability platform designed to monitor metrics and logs with unparalleled efficiency. Netdata requires zero-configuration to get started, and provides alerts, anomaly detection and AI assisted troubleshooting out of the box, providing a powerful and comprehensive infrastructure monitoring experience. Netdata is known for its distributed design. Instead of funneling all data into a few central databases like most traditional monitoring solutions, Netdata processes data at the edge, keeping it close to the source.

Introducing Netdata Insights

Subscribe to the channel → / @netdata Now in research preview: Netdata Insights The problem: Incident? You're jumping between dashboards, piecing together timelines. Reporting? You're copy-pasting charts and correlating trends by hand. The data’s there, but turning it into a narrative doesn’t scale. The solution: Netdata Insights. Synthesizes high-fidelity telemetry using the latest LLMs into AI-powered reports with natural-language explanations, visuals, and clear recommendations.

The Complete Guide to APM Best Practices for Developers, DevOps & SREs

Application Performance Monitoring (APM) is no longer optional, it is essential for delivering fast, reliable, and seamless digital experiences. But simply installing an APM tool isn’t enough. To truly know its potential, IT teams need to follow APM best practices. Best practices for APM refer to the most effective ways to monitor, analyze, and optimize your application’s performance using APM tools.

A little love for two old fellas - Icinga Business Process Modeling and Icinga Web Graphite Integration

Today is the day, we grant two products their long overdue maintenance. Maintenance always sounds boring, I hear you. But let me remind you that this also means we do and take care! And what this actually is all about: Now let’s see what each release offers!

Close the gaps in your SCOM monitoring with the Opslogix Autonomous Windows Service Management Pack

Close the gaps in your SCOM monitoring with the Opslogix Autonomous Windows Service Management Pack SCOM offers strong monitoring capabilities, which is extended through its various Management Packs. However, a common challenge is that some Windows services goes unmonitored, simply because they don’t belong to a specific Microsoft technology like SQL Server or IIS.