Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

KubeCon 2024 | Interviews with Observability Experts | Observability Insights with Josh Lee

Join me at KubeCon 2024 as I sit down with Josh Lee, Developer Advocate at Altinity, to discuss the latest trends, challenges, and insights in observability. In this interview, we cover key topics such as OpenTelemetry adoption (including the Open Agent Management Protocol), data sovereignty, standardization through semantic conventions, and the need to unify observability tooling across organizations.

Fast-Track Kubernetes Observability with Logz.io and OpenTelemetry: A quick getting started guide

In formal terms, OpenTelemetry is an open source framework used for instrumenting, generating, collecting, and exporting telemetry data for applications, services, and infrastructure. It provides vendor-neutral tools, SDKs and APIs for generating, collecting, and exporting telemetry data such as traces, metrics, and logs to any observability backend, including both open source and commercial tools.

Top Dynatrace Competitors and Alternatives for Modern Observability in 2025

Observability tools are crucial for maintaining the seamless performance and reliability of systems. Dynatrace has been one of the leading solutions for monitoring and observability over the past few years. However, there are many alternatives that provide similar features, often at more accessible price points and with unique capabilities. In this article, we will explore the best Dynatrace alternatives for 2025 to help you find the right fit for your organization.

SolarWinds Network and Infrastructure Observability

SolarWinds observability helps IT teams gain complete visibility across on-prem and cloud environments. Monitor everything from physical servers to AWS, Azure, and Kubernetes with real-time insights and traffic flow analysis. Quickly identify and resolve issues to optimize performance, simplify workflows, and reduce downtime. Get the unified visibility you need with SolarWinds—wherever you need IT.

New Relic Cost Optimization: 9 Surefire Ways To Cut Your Observability Costs

New Relic has established itself as a top observability platform with full-stack monitoring. Unifying all telemetry data — metrics, events, logs, and traces — into one platform delivers deep performance insights and enables faster troubleshooting without juggling multiple tools. Also, New Relic prioritizes developers with tools like CodeStream, integrating error details and telemetry directly into the IDE.

AIOps: Prove It!

I’ve read a steadily increasing stream of articles about using AI in SRE, and I have yet to find one that inspires my trust. Each article makes impressive claims about the capabilities of AI and the way it can be applied to SRE tasks, but the vast majority are light on details. AI tools, and especially LLMs, are growing incredibly quickly, and I feel that these tools have a ton of potential.

Understanding Observability, Monitoring, and Telemetry Differences

In the area of IT infrastructure management, three terms often surface: observability, monitoring, and telemetry. These concepts, while interconnected, each play a unique role in maintaining system health and performance. Observability, monitoring, and telemetry form the backbone of any robust IT environment. Yet, their differences and interrelations can sometimes blur, leading to confusion. This article aims to demystify these terms, providing clarity on their distinct roles and how they work together.

Open source log management tools in 2025

Log management tools provide visibility into the performance and behavior of systems, applications, networks, and infrastructure components. By collecting and analyzing logs, you can monitor for anomalies, track trends, and identify potential issues before they escalate. Choosing the right log management solution requires careful consideration of several factors to ensure that it meets your specific needs and goals. Here are the most popular open source log management tools to help you choose.

Microservices Aren't the Goal: What we Check Before Splitting a Monolith

Most "we should move to microservices" conversations start as architecture debates, but they're almost always driven by operational pain. Releases feel fragile. Incidents take longer to diagnose. Scaling one busy area means scaling everything. Coordination costs grow faster than the product. Over time, we've learned to treat microservices as a tool that you pick to remove a specific constraint, not as a badge of maturity. The most useful starting question is blunt: what outcome is the current architecture blocking today, and is distribution really the cheapest way to unlock it?