Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

The Future of Kubernetes Observability

The Kubernetes ecosystem is undergoing a significant transformation, and the trends emerging at KubeCon highlight just how dynamic this space has become. Traditional Application Performance Monitoring (APM) providers are rapidly shifting focus to Kubernetes Performance Monitoring (KPM), reflecting the growing need for specialized observability in increasingly complex environments.

OpenTelemetry - Complete Guide to the Open-Source Observability Framework

In cloud-native environments, observability is key to ensuring the health, performance, and stability of distributed systems. Observability helps developers and operations teams understand how their systems behave in real time, helping diagnose issues, optimize performance, and meet service-level agreements.

How to Fix "Upstream Connect Error" in 7 Different Contexts

The error "upstream connect error or disconnect/reset before headers. reset reason: connection failure" has become a challenge for DevOps teams. This critical error, occurring when services fail to establish or maintain connections with their upstream dependencies, can significantly impact system reliability and user experience.

The future is now, introducing Dynamic Observability from AI innovations built on logs

A year ago, I shared my thoughts at re:Invent, explaining why I joined Sumo Logic as CEO and laid out the importance of logs as a key differentiator. A year later, the atomic level of logs is even more paramount. It’s not just because Sumo Logic is years ahead in technology when it comes to ingesting and analyzing structured and unstructured logs.

From ELK Stack to easy - Elastic Observability on Elastic Cloud Serverless

Announcing the general availability of Elastic Observability on Elastic Cloud Serverless — a fully managed observability solution As organizations scale, an observability solution that can handle the complexity of distributed cloud environments and provide real-time insights often feels like an insurmountable challenge often due to data- and cost-related compromises.

Duolingo: Speaking the Language of Observability with Honeycomb

In the world of digital language learning, Duolingo stands out as a beacon of innovation and user engagement. With millions of users worldwide, their platform is designed not only to teach languages, but also to create a fun and engaging learning experience. Running on the robust AWS cloud infrastructure, Duolingo manages vast amounts of data and user interactions daily. As the company experienced rapid growth, Duolingo remained steadfast in their commitment to delivering a high-quality user experience.

Lightrun Unveils Game-Changing Visual Studio Extension and Dynamic Traces at AWS ReInvent 2024

As we kick off the AWS re:Invent 2024 conference, we’re thrilled to introduce two major developer observability and live debugging advancements that bring even greater power and flexibility to developers and engineering teams everywhere. These new product capabilities — the Lightrun Visual Studio Extension and Lightrun Dynamic Traces — are designed to elevate customers’ observability workflows and streamline their development processes directly within their IDE.
Sponsored Post

What Do DevOps Professionals Really Mean When They Talk About Kubernetes (K8s)?

In the world of DevOps, Kubernetes (K8s) is more than just a tool for managing containers-it's the backbone of modern infrastructure. When DevOps teams mention Kubernetes, they're referencing its vast capabilities, which extend far beyond basic container orchestration. They're talking about its ability to manage scaling, automation, networking, and security across complex, distributed systems. In this article, we'll explore what DevOps pros really mean when they discuss Kubernetes, highlighting the core features that make it a cornerstone of the DevOps ecosystem.

Observability to Generative AI: Journey in Evolving IT Operations

For those of us managing the ever-evolving IT infrastructure, the days of simple cause-and-effect relationships are long gone. A performance dip in one application might affect microservices, destabilizing the systems. Alerts – flood in, logs – pile up, and even the most sophisticated monitoring dashboards often leave asking: Where do we even begin?