Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

What is Digital Adoption? Strategies for 2025

In today’s digital-first workplace, it’s not enough to deploy new software. You need your teams to actually use it. That’s where digital adoption comes in. Digital adoption is the process by which individuals not only learn how to use digital tools but also integrate them into their day-to-day tasks in a way that enhances performance. True digital adoption means employees are using the right features, in the right context, to complete work with minimal friction and maximum confidence.

Common Issues with Grafana Login and How to Fix Them

Grafana is a popular choice for monitoring and visualizing metrics, but login issues can quickly block your access and slow you down. Forgot your password? Can’t get into the admin account? Problems after changing authentication settings? These are some of the most common hiccups—and they’re usually easy to fix. This guide covers the frequent login problems you might face and walks you through practical ways to resolve them.

Track the Right Elasticsearch Metrics Without the Noise

Elasticsearch does a lot right—it's fast, scalable, and makes searches feel simple. But when things slow down or break, figuring out what’s going on can be frustrating. Especially if you’re not keeping an eye on the right metrics. This guide covers Elasticsearch metrics that are worth tracking and how they help you keep your cluster healthy without data overload.

OpenTelemetry vs Micrometer: Here's How to Decide

In a distributed system, things break in unexpected ways. That’s why observability isn’t optional—it’s how you understand what’s going on under the hood. If you’re comparing tools to instrument your services, OpenTelemetry and Micrometer are two names you’ll run into. Both are used to collect metrics, but they take very different approaches—especially when it comes to flexibility, vendor support, and what you can do with the data.

Using the OpenTelemetry Operator to boost your observability

If you’ve ever wrangled sidecars or sprinkled instrumentation code just to get basic trace data, you know the setup overhead isn’t always worth the payoff. But what if it was… just easier? That’s where the OpenTelemetry Operator for Kubernetes steps in… and it plays great with Coralogix out of the box!

Real-time detection of BGP blackholing and prefix hijacks

Border Gateway Protocol (BGP) remains the backbone of inter-domain routing on the Internet, but its fundamental trust model leaves it vulnerable to misconfigurations, hijacks, and blackholing. When these issues occur, they often go undetected by the impacted networks—until users report degraded performance or service outages. This post walks through a real-world incident in which a legitimate traffic spike led to an upstream provider mistakenly blackholing a critical IP address.

How to Troubleshoot Faster with LM Logs

When an alert fires, your goal is clear: fix the problem—fast. But traditional troubleshooting rarely makes that easy. You’re immediately thrown into decision mode: All the while, the clock is ticking. The longer you’re stuck guessing what to do next, the longer your downtime drags on, and the more non-value-added engineering time you burn.

Understand and manage your Datadog spend with Datadog cost data in Cloud Cost Management

As your organization scales its Datadog footprint, you want to understand what’s driving cost changes and promote cost awareness. But to take meaningful action, you need more than a monthly bill—you need real-time, contextualized cost data tied to services and teams. Without this visibility, it’s hard to assign ownership, prevent cost overruns, or identify which changes are affecting spend.