Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Grafana Cloud updates: new data visualization options, enhancements to Grafana Cloud k6, and more

We consistently roll out helpful updates and fun features in Grafana Cloud, our fully managed observability platform powered by the open source Grafana LGTM Stack (Loki for logs, Grafana for visualization, Tempo for traces, and Mimir for metrics). In case you missed it, here’s a roundup of the latest and greatest updates for Grafana Cloud this month. You can also read about all the features we add to Grafana Cloud in our What’s New in Grafana Cloud documentation.

Platform Engineering - Empowering Developers with Self-Service Tools

In the world of DevOps and cloud engineering, a new buzzword has emerged: Platform Engineering. This concept has sparked discussions across the industry, with professionals debating whether it replaces traditional DevOps or adds value to it. In reality, Platform engineering transforms how software teams work. It builds on DevOps principles to create self-service tools that boost developer productivity. This approach streamlines workflows, enhances security, and cuts operational costs.

Four key benefits of configuration templates in network automation

Configuration templates consist of small pieces of code that allow administrators to implement changes across numerous devices as many times as necessary. These templates, often referred to as configlets, expedite system setup and make it more resistant to errors.

Strategies that foolproof your AWS disaster recovery strategies

AWS disaster recovery strategies In today's interconnected world, business continuity is no longer a luxury but a necessity. Disasters, both natural and man-made, can cripple operations, leading to significant financial losses and reputational damage. To mitigate these risks, organizations are increasingly turning to cloud-based solutions, with Amazon Web Services (AWS) emerging as a preferred platform for disaster recovery (DR) strategies.

Enhance your GenAI application monitoring with Crest Data's Datadog Marketplace integrations

As organizations begin developing generative artificial intelligence (GenAI) applications, observability challenges could hinder their progress. Few robust monitoring tools for GenAI applications are available, which makes identifying and resolving issues in these applications time-consuming and error-prone.

What is Fleet Management in Telemetry?

Fleet management is a derivative term. Originally used in the automotive industry, it’s now used in a span of domains. It’s being used in data telemetry since the introduction of OpAmp, which is a part of the Open Telemetry project. Now, fleet management has broader implications. It simplifies telemetry data collection by automating agent deployment, and configuration, and providing insights into the real-time health and performance of your sprawling agent infrastructure.

SD-WAN: Dead or Different?

The rapid evolution of work models and security requirements has prompted questions about the relevance of Software-Defined Wide Area Network (SD-WAN) technology. In their insightful report, ‘Is SD-WAN Dead?’ Jonathan Forest and Andrew Lerner of Gartner explore these dynamics, concluding that while SD-WAN is far from obsolete, its role is shifting.

Fundamentals of a Successful Logging and Observability Strategy

Your team is responsible for ensuring the reliability and performance of your organization’s critical applications and infrastructure. What keeps you up at night? Your applications are more complex, distributed and cloud-native than ever, meaning that understanding what’s happening under the hood has never been more complex than it is now. Is it system bugs, or data bottlenecks? Chasing alerts for latency or service degradation that may or may not be business-critical?