Operations | Monitoring | ITSM | DevOps | Cloud

Observability

The latest News and Information on Observabilty for complex systems and related technologies.

Destroy on Friday: The Big Day A Chaos Engineering Experiment - Part 2

In my last blog post, I explained why we decided to destroy one third of our infrastructure in production just to see what would happen. This is part two, where I go over the big day. How did our chaos engineering experiment go? Find out below!

Streamlining Debugging with Lightrun Snapshots: A Superior Alternative to Trace Logging

According to a recent study, failing tests alone cost the enterprise software market an astonishing $61 billion annually. This figure mirrors the vast number of resources devoted to rectifying software failures, translating into about 620 million developer hours lost each year. On average, engineers spend 13 hours to resolve a single software failure, a statistic that paints a stark picture of the current state of debugging efficiency.
Featured Post

AI-enabled observability solutions are essential to manage application performance and security in on-premises environments

For all of the focus given to cloud-native technologies over recent years, it's sometimes easy to forget that a huge number of organizations continue to run their business critical and applications on-premises. And this will undoubtedly be the situation for some years to come within the public sector and in industries such as financial services and healthcare where organizations need to adhere to strict data privacy and security rules.

Enlightning - Instant Observability: Exploring the Magic of Pixie

Pixie is an open source observability tool for Kubernetes applications. Pixie automatically collects telemetry data, including full-body requests, resource and network metrics, application profiles, and more. Using Pixie, developers can view the high-level state of their cluster (service maps, cluster resources, application traffic) and also drill down into more detailed views (pod state, flame graphs) without having to modify or redeploy their code.

OpenTelemetry, AI, and the Future of Observability with Andreas Grabner

Shubham Srivastava from our team had the pleasure of meeting Andreas Grabner at KubeCon + CloudNativeCon Europe earlier this year. Andreas wears many hats in his daily work, primarily serving as a DevOps Activist at Dynatrace, where he has dedicated over 16 years to shape the Observability solutions we see today. He is also a Developer Advocate at Keptn – helping teams automate and orchestrate their deployments end-to-end and plays an active role as an Ambassador in the CNCF community.

Cribl's Blueprint for Secure Software Development.

What does it take to build software for the most security-demanding customers worldwide? At Cribl, building secure products is integral to our engineering identity. We have established a secure software development lifecycle that is both culturally and policy-driven, integrating product security tooling and processes into every architecture review, pull request, and release, whether major or minor.

What Makes for a 'Good' Pair Programming Session?

Software changes so rapidly that developing on the cutting edge of it cannot fall to a single person. When it comes to asynchronously disseminating information about projects, code comments, PR conversations, Slack, RFCs, and other investigatory documents do a wonderful job, but no amount of async communication replaces the magic of two brains bouncing ideas off of each other.

Unleashing the Power of Hybrid Cloud - Introducing Hybrid Observability in HPE GreenLake Flex Solutions

In today's fast-paced digital economy, businesses are constantly seeking innovative solutions to streamline their operations, enhance agility, and drive growth. As enterprise IT infrastructure environments get more distributed and complicated to meet evolving demands, the need for robust IT monitoring, management and automation becomes even more important.