Operations | Monitoring | ITSM | DevOps | Cloud

Reliable Backups in a Multi-Cloud World

Proper backups are universally acknowledged as essential, yet they grow increasingly tedious and prone to error as DevOps complexity escalates. While some managed database services offer automated backup solutions, the scope of your backup requirements is likely to expand as the business and products scale. There's a considerable chance you'll find yourself hosting your own databases or stateful services, a task that can seem daunting in its demand for precision and reliability.

From Reaction to Action: Accelerating Incident Response through Automation

In the Digital Age, IT incidents are an unavoidable aspect of business operations. From hardware failures to security breaches, these disruptions can wreak havoc on business continuity and user experience. Managing these incidents effectively requires a timely, systematic approach encompassing detection, prioritization, resolution, and communication. Traditional incident response methods often fall short, resulting in costly delays and inefficiencies.

"Secret" elmah.io features #3 - Automate tasks with elmah.io CLI

In this third post in the series of "secret" elmah.io features, I want to introduce you to the elmah.io Command Line Interface (CLI). While you probably spend most of your elmah.io-related time inside the UI, the CLI offers some interesting possibilities not available through the web app. In this post, I'll show what I believe are the two most usable and helpful features of the elmah.io CLI. There are a lot of sub-commands so feel free to play around with it.

Grafana 11 release: The latest in visualizations, Scenes-powered dashboards, simple access controls, and more

At the opening keynote of GrafanaCON 2024, attendees in Amsterdam got a sneak peek at some of the latest features in Grafana 11, which is now available in preview. Grafana: download now! For those of you who couldn’t score a ticket to the sold-out event, don’t worry — we have a roundup of all the latest updates to visualizations that make it easier than ever to create beautiful dashboards in Grafana.

Loki 3.0 release: Bloom filters, native OpenTelemetry support, and more!

Welcome to the next chapter of Grafana Loki! After five years of dedicated development, countless hours of refining, and the support of an incredible community, we are thrilled to announce that Grafana Loki 3.0 is now generally available. The journey from 2.0 to 3.0 saw a lot of impressive changes to Loki. Loki is now more performant, and it’s capable of handling larger scales — all while remaining true to its roots of efficiency and simplicity.

Find your logs data with Explore Logs: No LogQL required!

We are thrilled to announce the preview of Explore Logs, a new way to browse your logs without writing LogQL. In this post, we’ll cover why we built Explore Logs and we’ll dive deeper into some of its features, including at-a-glance breakdowns by label, detected fields, and our new pattern detection. At the end, we’ll tell you how you can try Explore Logs for yourself today. But let’s start from the beginning — with good old LogQL.

Introducing an OpenTelemetry Collector distribution with built-in Prometheus pipelines: Grafana Alloy

In the opening keynote of GrafanaCON 2024, we announced our newest OSS project: Grafana Alloy, our open source distribution of the OpenTelemetry Collector. Alloy is a telemetry collector that is 100% OTLP compatible and offers native pipelines for OpenTelemetry and Prometheus telemetry formats, supporting metrics, logs, traces, and profiles. Some of you may be thinking: Wait, another collector?

Where to automate resilience testing in your SDLC

When organizations begin to deploy resilience testing or Chaos Engineering, there’s a natural question: can we integrate this with our CI/CD pipeline or release automation tools? After all, you’re likely running unit, performance, and integration tests already—is resiliency different? The short answer is yes—to both. Integration is possible, but resiliency is different, so automation is a nuanced conversation.