Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Tame Your Telemetry: Introducing the Honeycomb Telemetry Pipeline

Observability means you know what’s happening in your software systems, because they tell you. They tell you with telemetry: data emitted just for the people developing and operating the software. You already have telemetry–every log is a data point about something that happened. Structured logs or trace spans are even better, containing many pieces of data correlated in the same record. But you want to start from what you have, then improve it as you improve the software.

The Path to Autonomous Observability

Autonomous observability for system monitoring and management aims to use GenAI and machine learning to automatically detect, diagnose and resolve issues. In conversations about cloud observability today, discussions often shift from “what’s possible” to “what’s practical.” Too often, these conversations highlight the shortcomings of current observability processes, tools and financial models.

How to scale observability for AWS hybrid and multi-cloud environments

Managing observability across hybrid and multi-cloud environments is like flying a fleet of planes, each with different routes, altitudes, and destinations. You’re not just piloting a single aircraft; you’re coordinating across multiple clouds, on-premises systems, and services while ensuring performance, availability, and cost-efficiency. AWS customers, in particular, face challenges with workloads spanning multiple regions, data centers, and cloud providers.

Understanding Jaeger - From Basics to Advanced Distributed Tracing

Jaeger has emerged as a crucial tool in the modern distributed systems landscape, offering powerful tracing capabilities that help organizations understand and optimize their microservices architectures. This comprehensive guide explores everything from basic concepts to advanced implementations, providing you with the knowledge needed to effectively implement and utilize Jaeger in your environment.

Comprehensive Observability: Key User Experience Metrics to Monitor in Cloud Environments

As we conclude our three-part series on key observability metrics ScienceLogic monitors, this blog focuses on the analysis and impact of user experience (UX) metrics to shed light on their business impact. Whether it’s an internal business application or a customer-facing platform, a seamless and efficient user experience can significantly impact satisfaction, productivity, and loyalty.

How observability, AI and automation is leading the workload management evolution

Workload management is ubiquitous when it comes to automating critical business processes. With time, workload management as a technology is going through a gradual evolution, from ‘just automation’ to an orchestrator of intelligent automation. This necessitates a layer of observability and intelligence to facilitate the move from workload automation to workload management.

Determining a CoPE's Efficacy-and Everything After

As discussed in the first article in this series, a Center of Production Excellence (CoPE) is a more or less formal, provisional subsystem within an organization. Its purpose is to act from within to change that organization so that it’s more capable of achieving production excellence. The series has, to date, focused mainly on how best to construct such a subsystem and what activities it should pursue.

Reduce Observability Costs with OpenTelemetry Setup

Maintaining and visualizing telemetry data efficiently is super important for DevOps and SecOps teams. OpenTelemetry, a fantastic open-source observability framework, can really help with this without being too costly. Picture having a simple process that improves your data and helps your team make smart decisions without spending too much money. Let's chat about some budget-friendly ways to set up OpenTelemetry agents.