Operations | Monitoring | ITSM | DevOps | Cloud

An In-Depth Guide to Java Performance Monitoring for SREs

If you've ever had a Java application slow down in production and struggled to pinpoint the cause, you know the pain of performance issues. Java is a powerful, high-level language, but it doesn’t come without challenges—especially when it comes to resource management, garbage collection, and thread handling. This guide will take you through everything you need to know about Java performance monitoring, from key metrics to tools and best practices.

CLM Chowder: Digging Into the Cloud Latency of Azure, Google Cloud, and OCI

CLM Chowder is a new series which highlights notable observations of cloud connectivity surfaced by Kentik’s Cloud Latency Map. In this edition, we look at measurements from Alibaba (China), latency swings from South Africa, and a temporary latency jump from Marseilles to Asia.

The next generation of Grafana Mimir: Inside Mimir's redesigned architecture for increased reliability

This year Grafana Mimir — the open source, horizontally scalable, multi-tenant time series database (TSDB) — will celebrate its third anniversary. Over the years, Mimir has become the go-to, Prometheus-compatible metrics backend within the open source community, with 29 maintainers and more than 4.6k GitHub stars. Since introducing Mimir, we’ve worked hard to deliver on our promise of making it the most scalable and performant open source TSDB in the world.

Grafana Drilldown apps: the improved queryless experience formerly known as the Explore apps

When we introduced the Explore apps suite for metrics, logs, traces, and profiles last year at ObservabilityCON 2024, our goal was simple: offer a queryless, point-and-click experience so you can quickly find insights in your observability data—no queries or complicated syntax required. Our commitment to that goal remains unchanged, but we’re excited to announce that the Explore apps have a new name: Grafana Drilldown.

Intelligent Alerting with RapidSpike and ilert Integration

When it comes to website performance and uptime, every second counts. Businesses rely on tools like RapidSpike to monitor their digital presence, ensuring websites and applications run smoothly. However, effective alerting and incident management are just as critical as monitoring itself. That’s where ilert comes in.

DORA Compliance - An Opportunity for MSPs

For Managed Service Providers (MSPs) in the EU, who serve financial organizations, DORA regulatory compliance is a hot topic. The DORA (Digital Operational Resilience Act) is a new regulation that came into force on Jan 17th, 2025, aimed at ensuring the operational resilience of financial entities in the EU, focusing on technology risk management and minimizing disruptions in critical services.

New Integration: ilert + RapidSpike for Proactive Website Monitoring

We are pleased to announce a new inbound integration in the ilert catalog: RapidSpike. This integration enhances incident management by connecting ilert with RapidSpike’s website monitoring capabilities, ensuring teams receive real-time alerts on website performance, uptime, and security threats.

5 Things We Learned from the Latest Public Sector Cybersecurity Report

Marketing Connections has published the Next-Gen Government IT: AI and Observability Insights Report in partnership with SolarWinds. The survey targeted 200 public sector IT decision-makers and influencers in the US and 100 of their counterparts in the UK. Here are five things we learned.

What Nature Can Teach Us About Alert Fatigue

Alert fatigue is a pervasive challenge in modern IT environments. When teams are inundated with false positives or low-priority notifications, it’s easy to lose sight of real issues. To kick off our blog series on the 5 most common obstacles to observability in 2025, let’s discuss the headache of alert fatigue and how insights from the natural world can offer answers.