Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Don't Let Downtime Define You: 10 Status Page Templates [2025]

In today's always-on world, your website or application is the lifeblood of your business. Downtime isn't just an inconvenience; it's a threat to your reputation, customer loyalty, and bottom line. As we highlighted in our recent article on MTTR, quickly resolving incidents is crucial. But equally important is how you communicate those incidents to your users. That's where status page templates come in.

From failure to fix: Diagnose Kubernetes Node and Pod problems with Site24x7

Picture a busy Monday morning. You are working on leftover projects from the previous week, and assuming everything is fine with your applications as you had not received support tickets during the weekend. All of a sudden, during the middle of the day, you get a flood of reports from users who complain about slow response in your application and error pages piling up. You and your team are scrambling hard to figure out the issue.

How to get started with error budgets to meet SLOs for improved service reliability

As modern IT systems grow in complexity, IT operations teams have to work harder to ensure reliability. "What gets measured gets managed" is a management mantra that emphasizes the role of metrics in management. To ensure everything works well, operations teams need service-level objectives (SLOs). This industry term measures how an application meets the agreed-upon quality and reliability standards, serving as a bellwether of good software.

Starlink Enters Transit Market With Community Gateways

Starlink moves beyond being strictly a direct-to-consumer service provider with the recent activations of its Community Gateways. In recent months, Starlink has become a transit provider to a small but growing number of service providers in remote parts of the world as its unique and groundbreaking service continues to evolve.

How JavaScript Execution Can Cause Browser Performance Issues #coding #chromedevtools #programming

Decode website loading sequences with Todd Gardner's essential guide to waterfall charts in this Concepts of Web Performance tutorial. Perfect for entry-level web developers struggling with slow websites, this video demystifies those intimidating colored bars you've seen in Chrome DevTools, WebPageTest, and monitoring tools like Request Metrics. Learn to interpret the crucial elements of waterfall charts—from request queuing and waiting times to content downloading phases—all visualized on a timeline measured in milliseconds. Discover how to identify two major performance bottlenecks.

How to use data source variables in Grafana dashboards

Data source variables let you change where Grafana looks for data without having to create duplicate dashboards. So for example, if you have multiple different Prometheus databases, you can have one dashboard and use a data source variable to choose which Prometheus that dashboard uses. We'll look at how to set these up in this video. Grafana Cloud is the easiest way to get started with Grafana dashboards, metrics, logs, and traces. Our forever-free tier includes access to 10k metrics, 50GB logs, 50GB traces and more. We also have plans for every use case.

The Future of Dynamic Observability with Sumo Logic -- Customer Brown Bag -- March 27th, 2025

Join us as Sr. Dir. Technical Marketer, Adam White, and Sr. Product Marketing Manager, Hadijah Creary, go beyond the usual technical deep dive—focusing on the mindset, industry trends, and thought leadership shaping modern observability and the future of dynamic observability with Sumo Logic.

Everything you need to know about HAProxy log format

HAProxy is one of today’s fastest and most widely used load balancing solutions. If you’re already using HAProxy or considering using it in your environment, understanding HAProxy logging is essential. Let’s discuss why HAProxy logging is vital to the load balancer implementation, the logging HAProxy offers, and how to manage and configure HAProxy logs to suit your unique needs.
Sponsored Post

Monitoring for operations of SAP S/4HANA Cloud, public edition

"Do I need to monitor SAP S/4HANA Cloud, public edition?" is the question many SAP customers are asking right now as projects are going live. As an SaaS product run by SAP, customers get access only through a public website, and SAP are responsible for the availability of that website and the hardware resources. The places where traditional monitoring focussed either aren't relevant, aren't visible, or superficially aren't the customer's problem anymore. Does that mean there is no need to monitor anything in SAP S/4HANA Cloud, public edition?