Operations | Monitoring | ITSM | DevOps | Cloud

September product updates

September was a busy month at StatusGator! We rolled out several major updates designed to give you more visibility, better integrations, and deeper control of your monitoring workflows. From new Early Warning Signal integrations to AWS Health support — plus our biggest API release yet — here’s a quick recap of everything we shipped last month.

Why Citrix VAD/DaaS Customers Using VMware Should Consider Migrating to XenServer

For years, VMware vSphere was the undisputed leader in enterprise virtualization. Its reliability, feature set, and ecosystem made it the go-to hypervisor for organizations. Also for organizations running Citrix Virtual Apps and Desktops (VAD) or Citrix DaaS, VMware was synonymous with virtualization excellence. But the landscape has changed, dramatically. If you're still running your Citrix workloads on VMware, it's time to take a serious look at XenServer, and here's why.

Announcing Scout's MCP Server for AI-Native Monitoring!

We’re excited to introduce the Scout Monitoring MCP Server — a new way to bring AI-native monitoring directly into your coding assistant. Instead of flipping between dashboards and logs, the MCP (Model Context Protocol) server surfaces performance data, errors, and slow endpoints right where you work. Ask plain-language questions like “show me the latest five errors” and get answers grounded in live telemetry. You can even let your coding assistant propose and push fixes!

Debugging Microservices in Production with Distributed Tracing

Your production checkout flow just started returning 500 errors. Six microservices handle checkout. Logs show errors in three of them. Which service broke? Which error happened first? What caused the cascade? Traditional debugging doesn't work. You can't attach a debugger to production. Searching logs across six services gives thousands of lines with no obvious connection. By the time you correlate timestamps and trace IDs manually, customers have abandoned their carts.

Cloud Microservices Monitoring on AWS and Azure with OpenTelemetry

Your checkout flow starts in an AWS Lambda function, calls a payment service running on EKS, then triggers notifications through Azure Functions. Three different compute platforms, two cloud providers, one distributed trace that you can't see. Cloud providers want you to use their native monitoring tools. AWS pushes X-Ray and CloudWatch. Azure promotes Application Insights and Azure Monitor. These tools work well within their ecosystems but lock you into vendor-specific implementations.

OpsHelm goes multi-cloud with Aiven Diskless BYOC, cuts costs by 78% over MSK

In under a month, OpsHelm the continuous, enriched changelog for cloud infrastructure - migrated its streaming backbone from MSK and NATS to Aiven Diskless Kafka (BYOC on AWS). The switch eliminated cross-cloud networking fees, collapsed multiple storage layers into one, and cut total streaming costs by 5x (from >$50,000/year to <$10,000/year) while serving the team a single logical event bus that stretches across multiple regions and accounts.

Sending beers all across Belgium, a throwback to how we named Oh Dear

We're obviously a little biased, but we believe we have one of the best website monitoring tools on the market today, leading in features compared to our competitors. We've already tried a variety of marketing techniques to promote our service, but none really had the impact we were looking for. Maybe we're better at actually building good software than we are at marketing it? Or are we trying what everyone else is also doing, thus making it all harder?

3 things you can do to get closer to five nines

5 minutes. That’s how much downtime some of the world’s largest enterprises will tolerate. For most organizations, five nines (99.999%) of availability sounds like a pipedream. But the trick to increasing availability isn’t massive infrastructure spending or complex system redesigns. All it takes are three key practices that any team can adopt and implement. In this post, we’ll present these practices and how we implement them at Gremlin.

Agentic AIOps in Action: LogicMonitor, IBM, and Red Hat Deliver Self-Healing IT

Your most skilled engineers shouldn’t be spending nights and weekends piecing together root causes of outages. Yet many organizations still rely on manual incident response across sprawling hybrid and multi-cloud environments. The result: slower resolution times, frustrated customers and lost revenue that can reach up to $1 million per hour according to IDC. At LogicMonitor, we believe the answer isn’t just better monitoring. It is systems that can heal themselves.