Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Introduction to Apache Kafka Scaling Challenges

Apache Kafka has become the go-to platform for organizations handling high-throughput, real-time data streaming. Its ability to manage massive data volumes while ensuring reliability is second to none. However, as businesses grow and demand for data increases, scaling Apache Kafka isn’t always a walk in the park.

Coralogix Expands AWS Partnership to Deliver AI-Driven Observability and Edge Threat Detection

Coralogix is proud to announce a new phase in its partnership with AWS through a Strategic Collaboration Agreement (SCA) focused on bringing AI-powered observability and security to the enterprise. At the heart of this collaboration is Amazon Bedrock, AWS’s managed service for foundation models.

Global API downtime increases by 60% in 2025, new data shows

London, 8 July 2025: Global API downtime increased by 60% in Q1 2025 compared to Q1 2024, shows new data from web service monitoring provider Uptrends, part of ITRS’ comprehensive observability platform. The State of API Reliability 2025 report — based on over 2 billion API monitoring checks across 20 industries in Q1 2024 and Q1 2025 — reveals a year-on-year drop in average API uptime from 99.66% to 99.46%, representing a decline of 0.2%.

Announcing Checkly Uptime Monitors: Simple, Scalable, and Built for Developers

When Checkly launched, it was the first of its kind, enabling developers to monitor complex workflows easier than ever using the automation tooling (Playwright, Terraform, etc) they already knew and loved. We’ve helped detect and resolve issues for 1000s of companies—ranging from monitoring crucial log-ins, to purchasing products, to setting up client instances for millions of monthly users But what about the simpler stuff?

The Defense-in-Depth Approach To Application Monitoring

In cybersecurity, defense-in-depth is a fundamental principle – you never rely on a single security measure to protect your systems. The same philosophy applies to application monitoring. No single monitoring approach, no matter how sophisticated, can capture every possible failure mode of your application. This is why layered monitoring isn't just a best practice – it's essential risk mitigation.

Introducing MetricFire Logging: Visualize Logs Alongside Metrics

As modern infrastructure grows more dynamic and distributed, collecting logs alongside metrics becomes a critical part of any observability strategy. To make this easy and powerful, MetricFire now supports a direct logging pipeline using Grafana Loki. This allows you to forward system logs from your servers to Hosted Graphite's Loki backend and visualize them in your Hosted Grafana dashboards with full control over queries, filtering, and alerting.

We now support Google Chat

I'm pleased to share that we've can now notify you via Google Chat. Here's what that looks like: Our Google Chat notifications include: You can read more on how to set up Google Chat notifications in our docs. Of course, we also offer numerous other channels to notify you when something is wrong with your site. I'm pleased to share that we've can now notify you via Google Chat.

Introduction to Kafka Scaling Challenges

Apache Kafka has become the go-to platform for organizations handling high-throughput, real-time data streaming. Its ability to manage massive data volumes while ensuring reliability is second to none. However, as businesses grow and demand for data increases, scaling Kafka isn’t always a walk in the park. It often comes with its own set of challenges that can throw even the most seasoned teams for a loop.

How They Handle 44 Million Searches a Day...Without Breaking! | Rightmove and Elastic

Rightmove, the UK's number one property search, and buying and selling platform has trusted Elastic for more than 11 years. Hear Andrei Nicusan, Principal Engineer at Rightmove on why Elastic has been Rightmove's number one Search and Observability solution for more than a decade. And now with the move to Elastic Cloud and Google Cloud Platform, you can find out how Rightmove are taking advantage of reductions in their infrastructure overheads too!