Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Observability: It's the User Experience, Stupid!

Observability, which originated from control theory, measures how well you can understand a system’s internal states from its external outputs. Observability uses instrumentation to provide insights that aid monitoring. In DevOps, gaining observability is achieved through a set of monitoring solutions. The shift to use one vendor platform to do so, versus multiple solutions, make sense as.

Cloud Observability 101: Start and End with Performance

Join network observability gurus Anil Murty and Dan Rohan for a real-world deep dive into the common cloud performance pitfalls, and how to avoid them. You’re adopting cloud in a big way, but your observability hasn’t kept up. Whether you’re responsible for your corporate network or revenue-producing service, you can’t afford performance blind spots.

ServiceNow acquires next-gen observability leader Lightstep

I’m excited to announce that ServiceNow has signed an agreement to acquire next-generation observability leader Lightstep. Combining Lightstep’s innovative observability capabilities with ServiceNow’s unmatched Now Platform will help customers better manage software complexity, reliability, and performance while enabling the enterprise workflows that deliver great experiences.

How Cool? Very Cool! Lightrun named a Cool Vendor by Gartner in Monitoring, Observability, and Cloud Operations

We are thrilled to announce that Lightrun — the world’s first dev-native continuous observability and debugging platform — has been recognized by Gartner as a Cool Vendor, based on its April 28 report titled, “Cool Vendors in Monitoring, Observability and Cloud Operations” by Padraig Byrne, Pankaj Prasad, Hassan Ennaciri, Venkat Rayapudi, and Gregg Siegfried. “Lightrun helps reduce mean time to repair (MTTR) by enabling continuous debugging capabilities.

Adding free and open Elastic APM as part of your Elastic Observability deployment

In a recent post we showed you how to get started with the free and open tier of Elastic Observability. Today we'll walk through what you need to do to expand your deployment so you can start gathering metrics from application performance monitoring (APM), or "tracing" data in your observability cluster, for free.

Dynamic Service Graph | Tigera - Long

Downtime is expensive and applications are a challenge to troubleshoot across a dynamic, distributed environment consisting of Kubernetes clusters. While development teams and service owners typically understand the microservices they are deploying, it’s often difficult to get a complete, shared view of dependencies and how all the services are communicating with each other across a cluster. Limited observability makes it extremely difficult to troubleshoot end-to-end connectivity issues which can impact application deployment.

Application Layer Observability | Tigera - Long

The majority of operational problems inherent to deploying microservices in a distributed architecture are linked to two areas: networking and observability. At the application layer (Layer 7), the need to understand all aspects associated with service-to-service communication within the cluster becomes paramount. Service-to-service network traffic at this layer is often using HTTP. DevOps teams struggle with these questions: Where is monitoring needed? How can I understand the impact of issues and effectively troubleshoot? And how can I effectively protect application-layer data?

Splunk Observability Cloud: Cutting through the complexity of modern applications

As infrastructure modernizes, it becomes more complex and more difficult to monitor and operate. To truly understand what your systems are doing, you need full-stack, end-to-end observability. We built Splunk Observability Cloud to eliminate your blind spots and go from alert to problem resolution in seconds–not hours. Splunk Observability Cloud provides one unified experience for seamless monitoring, troubleshooting, and resolution across any stack, at any scale.