Operations | Monitoring | ITSM | DevOps | Cloud

Observability

The latest News and Information on Observabilty for complex systems and related technologies.

Developing a culture of observability

In the race to attract and retain customers, businesses must deliver great customer experiences, release reliable products fast, and scrutinize costs to achieve consistent growth. That can either be a well-oiled machine or a tangle of disjointed communications and workflows that frustrate customers, employees, and management alike. By developing a culture of observability, you can have a framework that harmonizes the experience for everyone.

Does OpenTelemetry in .NET Cause Performance Degradation?

Contrary to Betteridge’s Law of Tabloid Headlines, the answer to the question, "does OpenTelemetry in.NET cause performance degradation?" is yes, but context is important. I get this question so often that I thought it was time to get some stats on it. I’ve heard comments like: I can only assume that these are based on previous versions, or things like OpenTracing / OpenCensus (the heritage frameworks that were the feeders for OpenTelemetry).

Why the visibility gap is holding your IT operations back

Depending on your business, MTTR stands for mean time to repair or mean time to recovery – but it can also mean resolution, resolve, or restore. No matter how you define it, the basic measurement is the same: it’s the time it takes from when something goes down to when it is back and fully functional. This includes everything from finding the problem to fixing it. For ITOps teams, keeping MTTR to an absolute minimum is crucial.

Enhancing cloud native application observability on AWS with business transaction insights

With business transaction insights in AppDynamics Cloud, you can turn cloud native chaos into business context. Here’s how. In any organization, technology plays a vital role in nearly every aspect of the business — from marketing to operations to human resources. But increasingly, its role in revenue generation is taking center stage. Profitability and growth are now in the hands of CTOs and CIOs.

Log Management in the Age of Observability

The explosive growth of interconnected data across distributed systems has disrupted traditional development, DevOps, and ITOps practices and forced many organizations to rethink their cloud strategies. Higher-velocity feature development and more responsive support requests involve developers throughout the delivery cycle and require them to monitor and observe application behavior before releasing it to production.

Expanding Our Vision: Unifying Client-Side Observability Data

In 2021, we started Request Metrics as a simple and developer-friendly service to measure and improve web performance. We built an incredible platform that distilled complex data down into simple reports and recommendations. Lots of teams around the world found valuable insights in Request Metrics that they couldn’t get anywhere else. But web performance data can be very unpredictable—the web slows down in all sorts of ways.

Cribl Reference Architecture Series: Scaling Effectively for a High Volume of Agents

Join Cribl’s Ed Bailey and Ahmed Kira in an insightful discussion about scaling your Cribl Stream architecture to accommodate a large number of agents. Managing high-volume agent data flows presents a unique set of challenges that must be addressed to ensure the reliable transmission of data from your endpoints to your analytics systems, meeting business resiliency requirements. Errors arising from agent scale and data volume can lead to difficult-to-diagnose and even more challenging-to-fix issues that tend to surface at the most inopportune times.
Sponsored Post

OpenTelemetry 101: A Non-Technical Guide to Starting Your Open Observability Journey

If you’re involved in IT Operations, you’ve probably heard of OpenTelemetry. It’s a hot topic in the observability industry, and for good reason. OpenTelemetry is a set of open-source tools and APIs that make it easy to collect telemetry data from your applications and infrastructure. This data can then be used to monitor your systems, troubleshoot problems, and improve performance.