Operations | Monitoring | ITSM | DevOps | Cloud

How Native Process Automation and Auto-Remediation Drive Operational Excellence

This is the second post in a series examining the requirements necessary to achieve operational excellence. Did you miss the first post? You can find it here. Maintaining continuous uptime and resolving issues swiftly has never been more critical in the rapidly changing digital operations landscape. Automation must become the industry standard, yet the distinction between native process automation and reliance on external tools has a significant impact on operational efficiency and responsiveness.

We now support Google Chat

I'm pleased to share that we've can now notify you via Google Chat. Here's what that looks like: Our Google Chat notifications include: You can read more on how to set up Google Chat notifications in our docs. Of course, we also offer numerous other channels to notify you when something is wrong with your site. I'm pleased to share that we've can now notify you via Google Chat.

Introduction to Kafka Scaling Challenges

Apache Kafka has become the go-to platform for organizations handling high-throughput, real-time data streaming. Its ability to manage massive data volumes while ensuring reliability is second to none. However, as businesses grow and demand for data increases, scaling Kafka isn’t always a walk in the park. It often comes with its own set of challenges that can throw even the most seasoned teams for a loop.

Resolve COO, Ari Stowe speaks at ONUG AI Networking Summit 2025 #itautomation #agenticai #ai #tech

Our COO Ari Stowe spoke at @onugcommunity's AI Networking Summit on how AI and Zero Ticket IT are transforming enterprise IT. From tickets to autonomous resolution—AI, automation, and intelligent agents are changing the game. Hear why AI is now essential in today’s complex IT environments.

How They Handle 44 Million Searches a Day...Without Breaking! | Rightmove and Elastic

Rightmove, the UK's number one property search, and buying and selling platform has trusted Elastic for more than 11 years. Hear Andrei Nicusan, Principal Engineer at Rightmove on why Elastic has been Rightmove's number one Search and Observability solution for more than a decade. And now with the move to Elastic Cloud and Google Cloud Platform, you can find out how Rightmove are taking advantage of reductions in their infrastructure overheads too!

Liquid Cooling vs. Air Cooling: What's Right For Your Data Center?

As power-hungry workloads like AI and HPC become the norm, data centers face mounting pressure to rethink their thermal strategies. Traditional air cooling has long been the industry standard, but with rising rack densities and energy costs, many operators are exploring liquid cooling as a more efficient alternative. In 2024, the global liquid cooling market was valued around $4.18 billion and is projected to reach $13.2 billion by 2029.

Observability in under 5 seconds: Reflecting on a year of grafana/otel-lgtm

With grafana/otel-lgtm, observability is just one Docker command away. Over the past year, grafana/otel-lgtm has simplified observability setups, helping developers get a complete OpenTelemetry stack running in under five seconds. With integrations for metrics, logs, traces, and now profiles via Grafana Pyroscope, it has become a go-to solution for demos, development, and testing, as evidenced by its growing community (1k stars on GitHub and growing!) and notable adopters.

How a Fortune 500 Company Eliminated 93% of IT Incidents in 72 Hours

Sometimes the biggest transformations begin with what sounds like the worst possible news. One day, this Fortune 500 technology company’s observability platform was running smoothly. The next, they learned their critical monitoring solution would be discontinued as part of a corporate buyout. For a leading global IT vendor in data infrastructure serving customers across storage, cloud, and managed services, this was a potential catastrophe.