Operations | Monitoring | ITSM | DevOps | Cloud

Observability

The latest News and Information on Observabilty for complex systems and related technologies.

Contextual Intelligence and Observability: Without the Former, You Really Don't Have the Latter

Observability is a hot term in the industry, but don’t let it fool you: having visibility into your organization's apps and services only gives you partial clarity into a system’s overall performance. To get a full understanding of your monitoring data, you need to apply contextual intelligence.

Quick Dictionary to Open<X> Projects in Observability

Do you also find yourself confused by all the Open-this and Open-that names flying around? There are currently a good few Open projects, standards, tools – OpenTelemetry, OpenTracing, OpenCensus, OpenSearch… heck, even my podcast is called OpenObservability! And new Open names seem to be popping up every other day. If you too feel this way, there’s no need. Many feel similarly confused.

Model-driven observability: modern monitoring with Juju

The end-to-end monitoring of complex software systems is difficult, toil-intensive and error-prone. Developers, SREs and Platform teams must continuously invest effort in setting up and maintaining the monitoring setups that underpin the observability of their systems, or accept the risk of being unaware of ongoing issues and their impact on end users. Enter model-driven observability powered by Juju!

Remote Debugging & Device Observability: How Memfault & Diamond Kinetics fix firmware bugs together

With their original products in the market, Diamond Kinetics relied on customer reports with little detail to fix issues in the field. They soon realized that this process of “psychic debugging” sensors in the wild was not enough and even delayed release dates as they tried perfecting each device before shipment. When they decided to launch a new generation of their product in 2020, they knew they needed a more efficient process.

Tale of the Beagle (Or It Doesn't Scale-Except When It Does)

If there’s one thing folks working in internet services love saying, it’s: "Yeah, sure, but that won’t scale." It’s an easy complaint to make, but in this post, we’ll walk through building a service using an approach that doesn’t scale in order to learn more about the problem. (And in the process, discovering that it actually did scale much longer than one would expect.)

PD Summit21: Transforming Infrastructure Teams Through Observability

What is this ""observability"" thing that everyone is talking about? Observability allows you to navigate the dark unknowns with echolocation while others attempt to fly blindly without it. Are your dashboards all green, but you still have an issue brewing? Do you need instant feedback based on the Core Analysis loop? Are your engineers tired of waking up at 3 AM for the expected issues? Is there a lack of time for experimentation? Generate your own answers and create a meaningful course of action with observability.

Detect any issue with Splunk APM before it turns into a customer problem

With 100% of spans and traces captured, Splunk APM meets any necessary business KPI’s and SLO metrics while investigating and troubleshooting transaction errors related to a backend application. Easily construct error budgets that measure performance of services today - learn how with this free trial Splunk Observability Cloud.

PD Summit21: MUX: Video Observability: Operational Alerting for Responding to Issues In Real-time

Streaming video accounts for the majority of internet traffic and your applications and infrastructure almost certainly include video. Mux Data allows you to easily monitor the real-time quality of experience delivered to your video viewers and integrating with PagerDuty you can automate a response and reduce the time to resolution when something goes wrong. We will cover the basics of video monitoring and how integrating with PagerDuty can ensure a great experience for viewers.

How Vanguard used Observability to Accelerate and De-risk their Cloud Migration

Rich Anakor, chief solutions architect at Vanguard, is on a small team with a big goal: Give Vanguard customers a better experience by enabling internal engineering teams to better understand their massively complex production environment—and to do that quickly across the entire organization, in the notoriously slow-moving financial services industry. They also had a big problem: The production environment itself.