Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Certificate Rotation with Progress-powered Solutions

Don’t let expired certificates put your organization at risk! Progress WhatsUp Gold makes it easy to discover, manage, and automate certificate lifecycles across your network. With powerful automation from Progress Infrastructure solutions, you can rotate and manage certificates without a manual routine to maintain compliance and security. Schedule updates, push certificates to thousands of nodes and maintain governance with built-in traceability. Experience simplicity, scalability and seamless integration with Progress-powered solutions.

Mute timing vs. silences in Grafana Alerting: How to pick the best fit for your use case

Have you ever been in a situation where know your team is going to run their weekly maintenance window and you silence your notifications to prevent a flood of false positives from pinging your inbox? If you are associated with a team that uses any type of alert system, you know how easily alert fatigue can happen. The incessant and unpredictable (or even, at times, predictable) pings, emails, and notification alerts can drive even the most serene worker totally batty.

How to Push Prometheus Metrics to Splunk Observability Cloud with the OpenTelemetry Collector

In this video, you’ll learn how to scrape Prometheus endpoints with the OpenTelemetry Collector’s Prometheus receiver and send metrics to Splunk Observability Cloud. We’ll walk through configuring three common data sources (a Python Flask app, node_exporter for host metrics, and the NGINX Prometheus exporter), show how to enrich metrics with resource attributes, and build simple charts in Splunk Observability Cloud. You’ll see how centralized scraping and consistent tagging make it easy to manage and visualize Prometheus metrics in Splunk Observability Cloud.

Monitor and optimize your systems with Uptrace

Uptrace is your single source of truth for monitoring, understanding, and optimizing complex distributed systems. Proven in production for over five years and trusted by more than a thousand installations worldwide, it lets you see your system like never before. What makes the difference is that Uptrace is pure OpenTelemetry, built natively from day one. This isn't a translation layer—it's a direct connection that eliminates friction and ensures zero vendor lock-in. Your homepage serves as your command center, providing complete visibility across your stack at a glance.

Distinct Value Cache in InfluxDB 3

The Distinct Value Cache in InfluxDB 3 speeds up metadata queries and tag value lookups for faster, more responsive UIs. The Distinct Value Cache in InfluxDB 3 delivers sub-30 ms lookups for tag values and series metadata, making exploratory queries and UI dropdowns quick and responsive. By reducing latency on these common operations, it allows developers to build real-time monitoring and analytics tools without extra complexity.

What is SNMP Trap: Real-Time Alerts for Network Monitoring

Why wait for the next poll? An SNMP trap is a real-time alert sent from a device to a monitoring system, without waiting for polling. Ever had a router die silently at 3 AM while your monitoring system was still polling away every 5 minutes? Yeah… not fun. That’s where SNMP traps step in. Think of them as the push notifications of network monitoring: instant, lightweight, and sometimes misunderstood.

An overview of Context Propagation in OpenTelemetry

To effectively manage modern applications, you need to understand how they work on the inside. Distributed tracing is the key to this, providing a detailed picture of a request's journey across every service. OpenTelemetry has emerged as the industry-standard framework for implementing tracing and achieving true observability in complex, distributed systems. In this article, we embark on a journey to explore the core concept of context propagation within Open Telemetry.

Two Decades of Microsoft SCOM & Monitoring Expertise

In today’s complex IT environments, reliable monitoring isn’t optional — it’s essential. From critical infrastructure in Government & Defense to highly regulated sectors like Healthcare, Energy, and Finance, organizations worldwide trust NiCE to deliver secure, future-ready monitoring solutions.

Your Next Observability RFP is All Wrong. Why AI Changes Everything

AI-first observability addresses two of the most pressing troubleshooting challenges: complex IT environments and AI-generated code. But understanding how to implement AI in a way that brings ROI, requires cutting through the hype and maintaining realistic expectations, while keeping a forward-thinking vision. In this blog post, we bring practical tips for including AI in your next observability RFP. The article is based on a webinar held with Logz.io founders, CEO Tomer Levy and CTO Asaf Yigal.