Operations | Monitoring | ITSM | DevOps | Cloud

Kubernetes Monitoring 101: 25 Tools And Must-Know Tips

The Kubernetes platform is the standard for orchestrating containerized applications. It’s ideal for large applications running on distributed instances. However, monitoring Kubernetes infrastructure can be notoriously challenging. This guide will cover Kubernetes monitoring in more detail, including what metrics to track to improve visibility and control over your K8s containers, apps, microservices, etc.

OpenTelemetry Collector: A Complete Guide [2025]

The OpenTelemetry Collector is a stand-alone service that acts as a powerful, vendor-neutral pipeline for your telemetry data. It can receive, process, and export logs, metrics, and traces, giving you full control over your observability data before it reaches a backend. This guide will provide a comprehensive overview of the OpenTelemetry Collector, its architecture, deployment patterns, and how to configure it for production use.

Notes from the Field: Seamless SSO 404s Impacting Citrix on Windows Server 2025

As a Citrix consultant, not every issue I troubleshoot is directly tied to Citrix, but many of them dramatically impact the end-user experience. This is one of those cases. A customer had begun testing Windows Server 2025 as Multi-Session hosts in their environment. The new servers were domain-joined and fully patched, and they expected a smooth experience with Office 365, Entra ID–backed apps, and cloud-based authentication. Everything had worked flawlessly on Server 2022.

Bringing Intelligence and Automation Together to Change the Shape of Work

The aspirational target state for a cognitive system is to “take responsibility” for a domain (e.g., an autonomous car). To reach that level of sophistication, the system must achieve high levels of maturity simultaneously along two dimensions: Reasoning ability and Automation ability.

Introduction to Apache Kafka Scaling Challenges

Apache Kafka has become the go-to platform for organizations handling high-throughput, real-time data streaming. Its ability to manage massive data volumes while ensuring reliability is second to none. However, as businesses grow and demand for data increases, scaling Apache Kafka isn’t always a walk in the park.

Coralogix Expands AWS Partnership to Deliver AI-Driven Observability and Edge Threat Detection

Coralogix is proud to announce a new phase in its partnership with AWS through a Strategic Collaboration Agreement (SCA) focused on bringing AI-powered observability and security to the enterprise. At the heart of this collaboration is Amazon Bedrock, AWS’s managed service for foundation models.

Got AI Fear? You Shouldn't; It's Coming for Your Busywork, Not Your Job

Artificial Intelligence (AI) has rapidly become a cornerstone of modern IT operations. Yet, despite its transformative potential, many IT professionals harbor apprehensions about integrating AI into their workflows. This growing AI fear, while understandable, often stems from misconceptions and a lack of clarity about AI's role and capabilities. This discussion aims to address and debunk common fears associated with agentic AI.

Global API downtime increases by 60% in 2025, new data shows

London, 8 July 2025: Global API downtime increased by 60% in Q1 2025 compared to Q1 2024, shows new data from web service monitoring provider Uptrends, part of ITRS’ comprehensive observability platform. The State of API Reliability 2025 report — based on over 2 billion API monitoring checks across 20 industries in Q1 2024 and Q1 2025 — reveals a year-on-year drop in average API uptime from 99.66% to 99.46%, representing a decline of 0.2%.

Announcing Checkly Uptime Monitors: Simple, Scalable, and Built for Developers

When Checkly launched, it was the first of its kind, enabling developers to monitor complex workflows easier than ever using the automation tooling (Playwright, Terraform, etc) they already knew and loved. We’ve helped detect and resolve issues for 1000s of companies—ranging from monitoring crucial log-ins, to purchasing products, to setting up client instances for millions of monthly users But what about the simpler stuff?

What our users make with Ubuntu Pro - Episode 1

Ubuntu Pro isn’t just for enterprises – it’s for the passionate community that powers and supports open source every day. From secure remote access to homelab hardening, Ubuntu Pro helps users get more from their systems, whether at work or at home. In this series, we talk to real users about how they use Ubuntu Pro in their personal and professional lives. We begin with Marc Grondin, a longtime Linux user and Ubuntu Pro subscriber based in Quebec, Canada.