Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Case Study: Building an Operations Dashboard

Picture a simple E-commerce platform with the following components, each generating logs and metrics. Imagine now the on-call Engineer responsible for this platform, feet up on a Sunday morning watching The Lord of The Rings with a coffee, when suddenly the on-call phone starts to ring! Oh no! It’s a customer phoning, and they report that sometimes, maybe a tenth of the time, the web front end is returning a generic error as they try to complete a workflow.

Multi-Cloud - Rise of Hybrid Networks and the Need to Monitor & Secure Them

This model has benefits, but at the same time, it introduces complexity for the IT teams tasked with monitoring and securing IT systems. Existing network monitoring technologies that system admins use with on-premise infrastructure are typically not expandable to include infrastructure and services running on public cloud platforms. This is a problem as you cannot manage and secure what you cannot see.

What is Apdex Score? Why is it Important?

In today's fast-paced and rapidly-evolving business landscape, it's more important than ever to keep track of how well your software applications are performing. That's where the Apdex score comes in. As a metric for measuring the user experience of an application, the Apdex score provides valuable insights into how your software is performing and how it can be improved.

Throw custom exceptions in Logic Apps: Using an API Management (Part V)

Welcome to the fifth and last part of this series of blog posts on How to throw custom exceptions inside Logic Apps. In all those posts, we talk about the following: The last approach we want to address in this series is another out-of-the-box idea: using an API exposed in API Management to throw back the exception. This approach is similar to the previous one.

Page Speed Monitoring Will Elevate Your Website's Performance

In the world of constant connectivity and digital realm, velocity is vital. Imagine a user reaching your website only to be met with a stark, blank page. Their anticipation hangs in the balance as they await any sign of engagement. Such an encounter does little to endorse the readiness or accessibility of your business. In today’s hyper-connected world, every single millisecond carries profound significance.

Multi-Cloud Made Simple: Announcing Kentik Observability Enhancements for AWS and Google Cloud

Limited visibility into network performance across multi-clouds frustrates even the best teams. That’s why we’re thrilled to announce enhanced AWS and GCP support for Kentik Cloud, enabling network, cloud, and infrastructure teams to rapidly troubleshoot and understand multi-cloud traffic.

Alert Tuning Recommendations: Reinventing Anomaly Alerts with Anodot

In the complex and dynamic realm of data analytics, real-time anomalies serve as insights to issues a business faces. A pervasive and enduring conundrum persists: accurately discerning between anomalies of significant importance and those of lesser consequence. This distinction is a nontrivial task as not all anomalies bear the same weight.

May 2023: Monitor Your Domain Expiration Feature

Remember when we promised you some exciting news in the UptimeRobot Discord server blog? The day has finally arrived! We’re happy to introduce our latest feature – domain expiration monitoring! Expired domains can make your website totally inaccessible and cause damage to your brand and business. Fixing expired domains can take days, and at the worst case you could lose the domain name entirely because someone may register it quicker.