Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Webinar: Petabyte Scale, Gigabyte Costs: Mezmo's ElasticSearch to Quickwit Evolution

Many engineering teams rely on ElasticSearch for search and analytics, but as data volumes grow, so do the challenges of scale, cost, and performance. At Mezmo, we faced this reality head-on, recognizing the need for a more efficient and scalable solution to support our multi-cluster, multi-petabyte telemetry data backend. After extensive evaluation, we made the leap to Quickwit, an open-source, cloud-native search engine for logs. But making such a fundamental architectural shift—without disrupting customers—was no small feat.

G2 Names Progress WhatsUp Gold a Leader in Network Traffic Analysis Grid Report

G2 has unveiled the leaders in the Network Traffic Analysis Grid Report, and the Progress WhatsUp Gold solution is one of them. Over 100+ G2 users have indicated that they are satisfied with WhatsUp Gold Network Traffic Analysis (NTA) and its numerous other features. The report states that 88% of users would highly recommend the WhatsUp Gold solution. In their quarterly reports, G2 will display leaders in particular technology sectors.

What MSPs Need to Know About ISO 27001 Compliance in 2025

In today’s evolving cybersecurity landscape, managed service providers (MSPs) play a critical role in ensuring their clients’ IT environments remain secure, compliant, and resilient. One of the most widely recognized global standards for information security management is ISO 27001—a framework that establishes best practices for managing security risks and protecting sensitive data.

Practical Tips on Handling Errors and Exceptions in Python

Have you ever encountered a confusing error message that left you wondering what went wrong in your Python code? You’re not alone. Even the most experienced developers run into exceptions, making it essential to understand how to handle them effectively. While basic syntax errors can be caught early by code editors and debugging tools, more complex issues often arise at runtime, requiring a structured approach to exception handling.
Sponsored Post

The Top 5 Security Logging Best Practices to Follow Now

Security logging is a critical part of modern cybersecurity, providing the foundation for detecting, analyzing, and responding to potential threats. As highlighted by OWASP, security logging and monitoring failures can lead to undetected security breaches. With the average cost of a data breach adding up to $4.45 million, most organizations can't afford to miss a security incident.

Finding UX Friction (...Before It Becomes a Problem)

Make it smooth. Reduce friction. Keep users moving. That’s solid advice. No one enjoys filling out a form with 10 unnecessary fields or dealing with a checkout process that feels like a maze. But you can’t fix friction if you don’t know where it’s happening. Big companies like Amazon, Netflix, and Airbnb don’t just guess where users are struggling. They track the right UX metrics, run experiments, and fine-tune their products constantly.

Coroot v1.9: Kubernetes-Native Database Monitoring Made Easy

From day one, we built Coroot to work beyond just Kubernetes. Many teams still run databases and other stateful services on dedicated VMs or bare-metal servers. But that’s starting to change. More and more teams no longer see Kubernetes as a platform just for stateless apps. Powerful Kubernetes operators now handle day-2 operations like failover, backups, and disaster recovery—making it easier than ever to run databases on Kubernetes. And the number of teams choosing this path keeps growing.

Optimizing Item Search: How Rollbar Engineered Faster, More Capable Search

Searching through error data efficiently is critical for developers using monitoring tools. At Rollbar, we recently completed a significant overhaul of our Item Search backend. The previous system faced performance limitations and constraints on search capabilities. This post details the technical challenges, the architectural changes we implemented, and the resulting performance gains.

Is It Time to Switch Your Network Monitoring Tool? How to Know & Choose the Right Upgrade

A while ago, your company chose a network monitoring tool that worked perfectly — back when most employees worked in the office, networks were centralized, applications ran on-premise, and "the cloud" was just a buzzword.

Simplifying public sector observability with OpenTelemetry and Elastic

Public sector organizations today face unique challenges in maintaining and optimizing their IT infrastructure and prioritizing efficiency and interoperability. With a mix of modern cloud and legacy systems, ensuring consistent performance, reliability, and security is paramount. To effectively observe across these environments, government agencies need observability tools that are open, flexible, and scalable. OpenTelemetry (OTel) is fast becoming a pivotal part of that flexible toolset.