Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

VictoriaMetrics Components: Getting Started

This article introduces the key components of VictoriaMetrics and explains how they work together as part of a complete monitoring system. VictoriaMetrics is a top-tier monitoring solution known for its speed and low-resource consumption. It includes components for monitoring, alerting, data visualization, querying, scraping, incremental backups, and more.

Windows Monitoring with Sysmon: Practical Guide and Configuration

One might think that, considering how effective some companies are at logging everything we do to serve us ads, they’d at least apply that to help us understand what’s happening on our systems and monitor their performance and security. But in the case of Windows, traditional logs fall short — and that’s where the importance of Sysmon comes in. Sysmon is a Windows service that logs operating system activity into the event log.

An ultimate step-by-step guide on Checkmk Cloud Monitoring

Checkmk launched Checkmk Cloud (SaaS) in February 2025, which is a fully managed, cloud-based version of their monitoring technology. This solution, designed for ease of use, allows enterprises to start monitoring their IT infrastructure with no installation, maintenance, or manual upgrades required. The SaaS version is compatible with both cloud-based and on-premises systems, bringing them together under a single, straightforward platform.

The Best Open-Source Dashboard Tools for 2025: Expert Guide to Choosing the Right One

Table of Contents In today’s digital operations, dashboards aren’t just nice-to-haves—they’re essential. Teams across engineering, product, operations, and business intelligence rely on real-time data visibility to monitor systems, analyze trends, and catch anomalies before they escalate. For many organizations, open-source dashboard tools offer the best combination of flexibility, transparency, and cost-efficiency.

It's not just about fixing problems, it's about detecting them before they escalate.

IT teams can’t solve what they can’t see. Undetected issues impacting end users lead to lost revenue, brand reputation damage, and frustrated customers. That’s why proactive monitoring is critical. By simulating end-user experiences, you catch small issues before they snowball into major incidents—saving time, money, and operational headaches.

What Is a Network Assessment, and What Is a Network Audit?

These days, networks are larger and more complex than ever. It’s all too easy to fall short when managing performance, security, and compliance. That’s where network assessments and network audits can help. Both network assessments and network audits can give you a more comprehensive understanding of your network and its current strengths, weaknesses, and threats. As a result, you can quickly identify and resolve issues.

Top 3 tools for DORA metrics reporting: SquaredUp vs Power BI vs Jira

What is it that makes a high-performing software engineering team successful? This was the challenge undertaken by the DevOps Research and Assessment (DORA) team around 2015, who created a set of metrics that could provide a reliable, data-driven way to measure and improve software delivery performance.

Meta-monitoring Loki (Loki Community Call May 2025)

In this Loki Community Call, we talk about the need for meta-monitoring Loki: why Loki needs to be monitored, what to watch out for, and how to do it. We talk about different ways to get information from Loki that allow you to make it reliable, consistent, and performant, including a Helm chart to deploy a meta-monitoring stack on Kubernetes. We discuss the Loki mixin for Grafana and how to use it to visualize data about Loki. On the call are Jay Clifford, Nicole van der Hoeven, and Dylan Guedes from Grafana Labs.

Cloud quotas: How to make cloud management easy

In the past, a cloud architect's pain point was usually deciding between these two options: To tackle this confusion, major cloud service providers (CSPs) launched quotas (in their own words). To give you examples, here are the different terminologies used by the three major public CSPs: The main ingredient of a well-oiled cloud setup that significantly impacts cloud operations is understanding and managing cloud quotas, also known as service quotas.