Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Achieving Comprehensive Network Observability for VMware Cloud Foundation

Private cloud infrastructure adoption is accelerating rapidly. This move is driven by the ongoing “cloud reset” as leaders rethink their hybrid and multi-cloud strategies, seeking greater control, security, and flexibility for their IT workloads. As a matter of fact, leaders in 69% of organizations are considering repatriating workloads, and one-third already have.

Data points per minute in Grafana Cloud: What you need to know about DPM

If you’re working with metrics in Grafana Cloud, chances are you’ve come across DPM (data points per minute). It shows up in usage dashboards, invoice breakdowns, and occasionally pops up in Slack when your ingestion numbers start looking suspicious. DPM can also be seen in the Grafana Cloud billing and usage dashboard, which is available by default in every Grafana Cloud account. It helps you understand how much data you’re sending—and whether it’s more than you need.

5 Ways to Optimize Your OpenSearch Cluster

OpenSearch is a powerful, scalable search and analytics engine that can do amazing things for logging, observability, and full-text search. But like any distributed system, it only performs well if you keep it properly tuned and healthy. Ignore it, and you risk slower queries, higher costs, and even data loss. Here are five practical tips to keep your OpenSearch cluster running smoothly and efficiently.

Guided by Trust: ScienceLogic Earns TrustRadius Top Rated for the Sixth Year Running

In a world where IT complexity is accelerating, trust has never been more essential. At ScienceLogic, trust isn’t just a value—it’s our compass. It guides how we innovate, how we serve, and how we grow alongside our customers. That’s why we’re proud to share that ScienceLogic SL1 has once again been named a Top Rated product on TrustRadius—for the sixth consecutive year. This recognition is more than a milestone.

An Easy Guide to Getting Started with Elastic APM

Code in production will break. Maybe a request takes too long, maybe it fails quietly, or maybe it works fine one minute and falls over the next. Logs can help, sure—but they don’t always show the full picture, especially when performance issues are involved. Elastic APM gives you a clearer view. It traces what your application is doing from incoming requests to database queries and everything in between.

No Sandwich, No Security: What This Week's Lunch Taught Me About DNS Blind Spots

Like many shoppers in the UK this week, I found myself staring at half-empty shelves in my local grocery store. In a small but frustrating twist, my usual sandwich, chicken mayo on malted bread, was nowhere to be found. The disruption wasn’t just about lunchtime preferences; it was part of a broader impact from cyberattacks that hit major UK retailers, including Co-op and Marks & Spencer.

How to Configure Lightweight Browser Tracing for Debugging at Scale

Sentry’s auto-instrumentation, using BrowserTracing, is convenient. You can get interesting insights about your frontend application out-of-the-box, such as whether slow and failing API calls are hurting your user experience (summarized in Network Requests), or how your website stacks up against industry standards for performance (summarized in Web Vitals).

Introducing Bits AI SRE, your AI on-call teammate

Getting paged pulls engineers away from meaningful work, yet incident response in many organizations remains manual, reactive, and draining. An alert fires and teams scramble to find the root cause, relying on siloed knowledge, incomplete context, and a few on-call experts who are already stretched thin. The rise of AI coding agents has only intensified this challenge: As teams ship code faster with less human oversight, production systems grow increasingly complex and harder to understand.