Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The Starlink Outage and Its Impact on Community Gateways

Last month, Starlink suffered its largest outage in years, arguably its biggest since becoming a major internet provider. In addition to the millions of individual customers around the world, the outage disconnected the Community Gateways, customers of Starlink’s new transit service. In this post, we delve into the outage and its impact on these far-flung networks.

How to Effectively Monitor Kubernetes in 2025

As Kubernetes environments continue to grow in scale and complexity, having a robust monitoring strategy is no longer just good practice, it’s essential for survival. For engineering teams in 2025, effective monitoring and observability is the bedrock of performance, reliability, and cost control. This guide dives into the critical aspects of modern Kubernetes monitoring, from key metrics to the top tools/frameworks and the rising role of AI in managing these complex systems.

Building a K12 IT Command Center: Monitor All Your Educational Services

Managing technology in K-12 schools has become increasingly complex. With dozens of educational platforms, administrative systems, and communication tools running simultaneously, IT teams need a comprehensive k12 it monitoring dashboard to maintain visibility across their entire technology ecosystem.

Taming Alert Chaos: Modern Incident Alert Management Strategies

Every IT team knows the feeling: your phone buzzes at 3 AM with yet another alert. Is it critical? Can it wait until morning? With dozens of monitoring tools and hundreds of potential failure points, incident alert management has become one of the most challenging aspects of maintaining reliable systems.

Announcing the Winner of the 2025 StatusGator Women in Tech Scholarship: Lara Djukic

Earlier this year, we launched the StatusGator Women in Tech Scholarship to support and empower women pursuing careers in technology. We are thrilled to announce that our 2025 scholarship recipient is Lara Djukic, an inspiring young technologist whose vision blends innovation with a deep commitment to her community. Through the Bold.org scholarship platform, we’ve award Lara a $3,100 scholarship.

How to Monitor Multiple School Platforms: Google Workspace, Canvas, and PowerSchool from One Dashboard

Managing technology in K12 schools means juggling dozens of critical platforms simultaneously. When Google Workspace goes down during morning classes, Canvas experiences issues during exam submissions, or PowerSchool becomes unavailable during grade entry periods, the impact ripples through entire school communities. The ability to monitor multiple school platforms from a centralized dashboard has become essential for educational IT teams.

How Elastic Powers Search in Real-Time (Explained in 52 Seconds)

Ever wondered how Wikipedia loads answers instantly? Or how does your Uber update in real-time? That’s Elastic Search working behind the scenes. In this video, I break down how Elastic powers lightning-fast, scalable search for complex data from ride requests to stock prices.

How to Adjust Semantic and Lexical Search Weights in Elasticsearch

In this session, we’ll show you how *hybrid search using Elastic* lets you assign weights to different search types — for example, giving semantic search three times more influence than lexical search. This lets you fine-tune the balance between precise keyword matching and broader, context-aware results.

Visualize Logs Alongside Metrics: A Complete Guide for Monitoring Slow MySQL Queries

When a service slows down, metrics will tell you that it’s happening but logs tell you why. For MySQL, slow queries can be a silent performance killer, gradually chewing through resources until users start complaining. By enabling MySQL’s slow query log and forwarding it to Loki (via Promtail), you can visualize query-level details right alongside your metrics on Grafana dashboards. This makes it easy to correlate what is slow (metrics) with what is causing the slowdown (logs).