Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Pandora FMS Stands Out in G2 Spring 2025 Reports: 35 Key Recognitions in Monitoring and Cybersecurity

Madrid, April 2025 – The monitoring and observability platform Pandora FMS has been recognized in 35 leading reports in the G2 Spring 2025 edition, solidifying its position as one of the most versatile solutions for managing complex IT infrastructures, hybrid environments, and critical operations.

How to keep Ingress NGINX Controller metric volumes manageable and still meaningful

The Ingress NGINX Controller is a widely used Kubernetes component for managing HTTP and HTTPS traffic routing. While it provides powerful observability through Prometheus metrics, it’s also notorious for generating an excessively high number of time series. The root cause lies in how the controller labels its metrics—tracking requests across multiple dimensions such as ingress name, host, path, status code, and upstream response times.

Common Downtime Causes and How Website Monitoring Can Help

Downtime only shows up at the most inconvenient moments — like right after a 'quick deploy' or during the five minutes you dared to step away. Maybe it’s a traffic spike hammering one endpoint and taking the rest down with it. Maybe it’s that 'small change' you confidently shipped straight to prod. Either way, users can’t reach your site, and now you’re debugging live in production.

Australia Is Investing in Resilience - Are Businesses Ready?

The 2025-26 Australian Federal Budget sets out a clear priority: building a stronger economy and a more resilient nation. That includes investment in critical infrastructure, skills and services to help Australians navigate ongoing uncertainty. More than $3 billion has been committed to upgrade the National Broadband Network (NBN), extending high-speed fibre to 95% of homes and businesses.

How to get alerted when your EC2 instance shuts down

Some of your most critical infrastructure runs on AWS EC2, so it's pretty damn important to know when your EC2 instances shut down. Sure, chances are someone in your organisation will start kicking and screaming within 30 minutes of a particularly important instance shutting down, but we can do better than that. When it comes to monitoring and customers (whether inside your org or outside), being proactive wins you a lot of points.

A Guide to OpenTelemetry Tracing in Distributed Systems

Understanding what’s happening inside your applications is key to keeping them performing well and reliably. OpenTelemetry tracing is an open-source, flexible solution that lets you monitor your distributed systems without locking you into a specific vendor. reliably This guide walks you through everything you need to know about OpenTelemetry tracing, from the basics to more advanced techniques, with practical tips for troubleshooting common issues along the way.

Apache Tomcat Performance Monitoring: Basics and Troubleshooting Tips

When Java web applications experience slowdowns or crashes, the culprit is often the Tomcat server. For DevOps engineers overseeing critical applications, proactive monitoring is crucial for ensuring optimal performance and reliability. In this guide, we'll explore the essential aspects of monitoring Apache Tomcat servers, focusing on the key metrics to track, setting up robust monitoring systems, and troubleshooting common performance issues that could impact your application’s stability.