Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

PostgreSQL Performance: Faster Queries and Better Throughput

A PostgreSQL setup that performed well with 10,000 users starts to show strain at 100,000. Queries that once returned in under 50ms now take over 2 seconds. The connection pool regularly hits its limit during peak usage, leading to timeouts and degraded performance. This blog focuses on practical ways to reduce query latency by 50–80% and increase throughput for high-concurrency environments.

Leaning into AI, ML, and observability to manage your ever-growing infrastructure

The complexity and scale of modern infrastructure requires an equally intelligent set of observability tools to effectively monitor it. Remember when scaling meant ordering new servers and racking them in a data center? Remember when cloud providers first offered access to seemingly infinite virtual machines at the click of a button? Remember when Kubernetes made it trivial for infrastructure to automatically scale itself based on demand?

New Feature - Vulnerable System Drivers Monitoring

Vulnerable system drivers continue to be a vector exploited by attackers to compromise systems. In eG Enterprise version 7.5 we added a number of periodic security checks to assist administrators proactively identify weaknesses, including vulnerable system drivers monitoring.This new capability is supported for a Windows OS, when using a VM agent for inside view monitoring and / or when monitoring an Azure Virtual Desktop session host.

Coralogix SLO Center & SLO Alerts are now available

Coralogix has released a new flagship service management product, the SLO Center. The SLO Center allows customers to define service level objectives (SLOs) for their teams. SLOs can be defined across multiple services or metric streams. Powered by the Coralogix Streama engine, this unlocks full coverage SLOs for every team, regardless of volume and with very high cardinality limits.

Coralogix becomes first observability vendor to earn ISO/IEC 42001:2023 certification for responsible AI

We’re proud to announce that Coralogix is now officially ISO/IEC 42001:2023 certified, becoming the first observability vendor to achieve this globally recognized standard for responsible AI management. ISO/IEC 42001:2023 is the world’s first international standard for Artificial Intelligence Management Systems (AIMS). It provides a comprehensive framework for how organizations should govern AI, focusing on transparency, ethical use, accountability, and regulatory compliance.

The Outage You Can't Afford: Why CMI/CME Providers Need Autonomous Operations Now

Imagine if degrading network performance—not just bad code—disrupted your live stream during a high-profile event. Customers start flooding support lines. Social media lights up. Your NOC teams scramble to identify the root cause amid fragmented systems. The outage impacts not only your broadcast, but also subscriber logins, ad delivery, and mobile apps. Advertisers want refunds. Executives ask, “Why didn’t we see this coming?”

Introducing Cribl Guard

Does sensitive data flowing through your network feel like a ticking time bomb? Well, it just might be. Legal mandates, security frameworks, and customer expectations have made the stakes higher than ever. One leaked spreadsheet of personally identifiable information (PII) can wipe out years of customer trust, rack up regulatory fines, and invite ransomware actors to your doorstep.

Domain Expiry and Its Impact on SEO: How to Monitor and Prevent Lapses

Your domain name is your digital real estate. It is how customers find you, search engines rank you, and your brand builds trust in the digital world. Whether you run a small blog, an e-commerce store, or a large business, your domain is the foundation of your online activities. But what happens if you forget to renew it? A domain expiry can cause your site to go offline. It can also hurt your SEO rankings and affect your website traffic.

Building on the foundation of OpenTelemetry eBPF Instrumentation: what's new in Grafana Beyla 2.5

Earlier this year, Grafana Labs donated Grafana Beyla — our open source eBPF-based, zero-code instrumentation tool — to OpenTelemetry under the new project name OpenTelemetry eBPF Instrumentation. In addition to reflecting our deep and long-standing commitment to the OpenTelemetry project, the donation marked a significant milestone in the evolution of zero-code eBPF instrumentation within the open source community at large.

Save Hours on Troubleshooting with Automated Investigations

How many times has your team stared at a dashboard, pointed to a spike, and asked a question that charts alone can’t answer? “What was the real impact of that deployment?” “Why are our Kubernetes pods in the us-east-1 cluster suddenly crashing?” “Are we wasting money on overprovisioned servers?” Answering these questions is the real work of operations and SRE.