Operations | Monitoring | ITSM | DevOps | Cloud

Powering Mexico's Digital Future: Expanded Internet Observability with Catchpoint

As of 2025, more than 110 million Mexicans are online, putting digital‐access penetration at roughly 83% of the population. Mexico is already one of Latin America’s anchor markets, leading the region in startup momentum, cloud adoption, and cross-border digital trade. A few days ago, CloudHQ announced a $4.6B investment in Mexico to open multiple datacenters. Yet even with this scale, service quality still varies dramatically across cities, states, and ISPs.

Baking in site reliability with observability and AI: How SpotOn uses Grafana Assistant to keep restaurants running

When you operate a restaurant, the last thing you want to do is shut your doors and turn away guests and staff because of some technology failure. And if you’re the one providing that tech, it’s your job to make sure that doesn’t happen. “For us, observability is about a lot more than just dashboards and alerts.

How NRP Scales Global Scientific Research with Calico

The National Research Platform (NRP) operates a globally distributed, high-performance computing and networking environment, with an average of 15,000 pods across 450 nodes supporting more than 3,000 scientific project namespaces. With its head node in San Diego, NRP connects research institutions and data centers worldwide via links ranging from 10 to 400 Gbps, serving more than 5,000 users in 70+ locations.

Major Retailer Accelerates Windows 11 Migration with Collective IQ

Migrating to a new operating system can be compared to renovating a house: you may be able to postpone it for a while, but eventually, you’ll have to face the project — and the relief only comes once it’s complete. While a home renovation can be planned with some flexibility, a Windows 11 migration has a fixed deadline and carries high risks. Every detail matters: compatible hardware, available disk space, drivers, and critical software that all need to align.

How DreamHost Slashed Memory Usage by 80% and Scaled to 76 Million Time Series

For any growing business, there comes a point where the tools that once worked perfectly begin to show their limits. This is especially true for monitoring infrastructure. As your user base, services, and data volumes expand, the pressure on your monitoring stack intensifies. For web hosting leader DreamHost, with over 1.5 million websites to manage, their existing open-source solutions simply couldn’t keep up.

How Nexus BMS Uses Time Series and AI to Power Smarter Buildings

Monitoring equipment isn’t enough for today’s smart buildings; true value comes from being able to predict issues, optimize performance, and take action automatically. Traditional building management systems often fall short, limited to dashboards and alarms that only notify you of an issue after the fact. With the rise of open source hardware, modern databases, and AI-driven diagnostics, facilities can now move from reactive to proactive management.

Zooplus Found Faster Root Cause Detection with Elastic Observability

Zooplus Platform Engineering Lead Aram Hakobayan shares how Elastic Observability helps manage 3,000+ microservices and 15,000+ logs/sec across their AWS cloud. Learn how Elastic powers their French market, centralizes monitoring, simplifies root cause analysis, and avoids costly vendor migration. Ideal for DevOps, SREs, and cloud architects scaling fast.

How Tipalti mastered Elasticsearch performance with AutoOps

From manual monitoring to proactive optimization, learn how Tipalti used AutoOps to save 10% annual costs. For a global payables automation leader like Tipalti, where financial transactions are the lifeblood of the business, infrastructure performance isn't just a technical goal; it's a core business requirement. Managing a complex ecosystem of databases, including Postgres, SQL Server, MongoDB, Kafka, and Elasticsearch, with a lean team of four engineers demands efficiency and powerful tooling.

The IT story behind 911 emergency services

At 2:37am on a cold Oregon night, a fire alarm blared at a rural station. Seconds later, the call came in: a structure fire on the outskirts of Rogue Valley. But what if that alarm never reached the station? This isn't a hypothetical. For the IT team at Emergency Communications of Southern Oregon (ECSO 911), it’s the kind of emergency scenario they prepare for every day.

How HireVue Turned Cloud Cost Chaos Into A Competitive Edge

When you’re a global leader in AI-assisted hiring, speed matters. Not just in matching candidates to jobs, but in making the engineering and financial decisions that keep your platform running efficiently. For HireVue, fragmented infrastructure, manual processes, and sprawling spreadsheets turned cloud cost management into a time-consuming spelunking expedition.