Operations | Monitoring | ITSM | DevOps | Cloud

Track the Right Elasticsearch Metrics Without the Noise

Elasticsearch does a lot right—it's fast, scalable, and makes searches feel simple. But when things slow down or break, figuring out what’s going on can be frustrating. Especially if you’re not keeping an eye on the right metrics. This guide covers Elasticsearch metrics that are worth tracking and how they help you keep your cluster healthy without data overload.

OpenTelemetry vs Micrometer: Here's How to Decide

In a distributed system, things break in unexpected ways. That’s why observability isn’t optional—it’s how you understand what’s going on under the hood. If you’re comparing tools to instrument your services, OpenTelemetry and Micrometer are two names you’ll run into. Both are used to collect metrics, but they take very different approaches—especially when it comes to flexibility, vendor support, and what you can do with the data.

Using the OpenTelemetry Operator to boost your observability

If you’ve ever wrangled sidecars or sprinkled instrumentation code just to get basic trace data, you know the setup overhead isn’t always worth the payoff. But what if it was… just easier? That’s where the OpenTelemetry Operator for Kubernetes steps in… and it plays great with Coralogix out of the box!

Real-time detection of BGP blackholing and prefix hijacks

Border Gateway Protocol (BGP) remains the backbone of inter-domain routing on the Internet, but its fundamental trust model leaves it vulnerable to misconfigurations, hijacks, and blackholing. When these issues occur, they often go undetected by the impacted networks—until users report degraded performance or service outages. This post walks through a real-world incident in which a legitimate traffic spike led to an upstream provider mistakenly blackholing a critical IP address.

Port + Komodor: Bringing Kubernetes Visibility into the Modern Commercial IDP

Internal Developer Portals (IDPs) are no longer just an experimental concept—they’re now a foundational component of modern software delivery. As engineering organizations look to reduce cognitive load, increase self-service, and streamline infrastructure workflows, IDPs have emerged as the most effective way to productize platform engineering.

How to Troubleshoot Faster with LM Logs

When an alert fires, your goal is clear: fix the problem—fast. But traditional troubleshooting rarely makes that easy. You’re immediately thrown into decision mode: All the while, the clock is ticking. The longer you’re stuck guessing what to do next, the longer your downtime drags on, and the more non-value-added engineering time you burn.

Rollbar and ilert: Real-time error monitoring meets smart incident response

We’re excited to share that Rollbar is now part of the ilert integration catalog! This new technical partnership allows software teams to detect application errors in real time with Rollbar and instantly respond using ilert’s powerful alerting and incident management features.

Boost your Android development with remote app testing via Anbox Cloud

In today’s hectic app development cycles, speed is key. But as teams scale and spread worldwide, and app complexity increases, traditional testing workflows become bottlenecks that can compromise security, increase operational costs, and delay product delivery. Simply sharing APKs or configuring physical test devices isn’t enough anymore – it’s time to rethink how we test and demo Android apps.

Understand and manage your Datadog spend with Datadog cost data in Cloud Cost Management

As your organization scales its Datadog footprint, you want to understand what’s driving cost changes and promote cost awareness. But to take meaningful action, you need more than a monthly bill—you need real-time, contextualized cost data tied to services and teams. Without this visibility, it’s hard to assign ownership, prevent cost overruns, or identify which changes are affecting spend.