Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Start with Traces, not with Logs: How Honeycomb Helped Massdriver Reduce Alert Fatigue

Massdriver is a cloud operations platform that makes it easier for engineering teams to build, deploy, and scale cloud-native applications. While many companies use this lofty language to make similar promises, Dave Williams, CTO and co-founder at Massdriver, means it. Before Massdriver, Dave worked in product engineering where he was constantly bogged down with DevOps toil. He spent his time doing everything except what he was hired to do: write software.

Live Debugging for Critical Systems

Live debugging refers to debugging software while running in production without causing any downtime. It has gained popularity in modern software development practices, which drives many critical systems across businesses and industries. In the context of always-on, cloud-native applications, unearthing severe bugs and fixing them in real time is only possible through live debugging. Therefore, live debugging becomes an integral part of any developer’s skill set.

How to deploy a Hello World web app with Elastic Observability on Azure Container Apps

Elastic Observability is the optimal tool to provide visibility into your running web apps. Microsoft Azure Container Apps is a fully managed environment that enables you to run containerized applications on a serverless platform so that your applications scale up and down. This allows you to accomplish the dual objective of serving every customer’s need for availability while meeting your needs to do so as efficiently as possible.

Why Real-Time Debugging Becomes Essential in Platform Engineering

Platform engineering has been one of the hottest keywords in the software community in recent years. As a natural extension of DevOps and the shift-left mentality it fosters, platform engineering is a subfield within software engineering that focuses on building and maintaining tools, workflows, and frameworks that allow developers to build and test their applications efficiently.

Troubleshooting Cloud Native Applications at Runtime

Organizations are moving to micro-services and container-based architectures because these modern environments enable speed, efficiency, availability, and the power to innovate and scale more quickly. However, when it comes to troubleshooting distributed cloud native applications, teams face a unique set of challenges due to the dynamic and decentralized nature of these systems.

Your Guide to Prometheus Observability

Imagine you’re piloting a spaceship through the cosmos, embarking on a thrilling journey to explore the far reaches of the universe. As the captain of this ship, you need a dashboard that displays critical information about your vessel, such as fuel levels, navigation data, and life support systems. This dashboard is your lifeline, providing you with real-time insights about the health and performance of various systems within your ship, so you can quickly make critical decisions.

Do you need better cloud observability - or AI-powered cloud visibility?

Maybe you’re still using monolithic applications, built and refined over many years. You understand that shifting to microservices or containerized architectures is a huge and daunting task. You’re probably grappling with the limitations of legacy systems—maybe they’re slow, tough to update, or can’t scale as you’d like. And you’re likely using more traditional IT monitoring tools or even some cloud observability tools.

Breaking Through the Observability Wall: Scaling Your Telemetry Architecture

In today's digital landscape, Observability and telemetry data play a crucial role in ensuring the performance, reliability, and security of modern applications and services. However, as data volumes explode due to the proliferation of micro-services, cloud-based applications, and connected devices, existing architectures are hitting a scalability wall.

Unlocking Observability - Dive into OpenTelemetry's Top Use Cases

OpenTelemetry can be used for generating and collecting telemetry signals like logs, metrics, and traces. The advantage of using OpenTelemetry for observability is that it is open-source and frees you from vendor lock-in. You can use OpenTelemetry for multiple use cases OpenTelemetry is an open-source project which has emerged as the standard for achieving comprehensive observability in modern applications.