Operations | Monitoring | ITSM | DevOps | Cloud

Trying and failing and trying again

Starting software products is hard, and it’s easy to make mistakes. We’ve started a lot of products – and we’ve made a whole lot of mistakes along the way. But that’s not going to stop us. We’re stubborn like that. Today we are launching Request Metrics for the third time, and I’m reflecting on what we did wrong in the first two attempts, and how we’re going to be better, faster, and strong next time.

Accelerate TraceQL queries at scale with dedicated attribute columns in Grafana Tempo

With Grafana Tempo 2.3, we introduced a new storage format (vParquet3), which enabled an exciting new feature (dedicated attribute columns) that focused on the read path. Dedicated attribute columns offer a wide range of benefits primarily centered around query performance and memory usage. These columns can improve read speed across most queries, and they can have a major impact on resource utilization.

Safeguarding Operations: A Comprehensive Guide to Disaster Recovery and Business Continuity for Data Center Managers

In the dynamic world of data center operations, preparedness is key. This blog serves as a comprehensive guide for data center operations managers, exploring the critical aspects of disaster recovery (DR) and business continuity (BC) planning. Learn how to fortify your data center against unforeseen events and ensure seamless operations even in the face of adversity.

Mastering Remote Management and Monitoring: A Guide for Data Center Operations Managers

In the fast-paced world of data center operations, the landscape is constantly evolving, and with the rise of remote work, the challenges and opportunities for operations managers have reached new heights. In this blog, we’ll explore the ins and outs of remote management and monitoring, providing insights and strategies to help data center operations managers navigate this dynamic terrain seamlessly.

Navigating Challenges with Precision: A Guide to Remote Incident Response for Data Center Operations Managers

In the era of distributed workforces, the need for effective remote incident response is more critical than ever. This blog serves as a comprehensive guide for data center operations managers, offering insights and strategies to navigate incidents with precision and efficiency, regardless of the geographical location.

Scale Your Splunk Cloud Operations With The Splunk Content Manager App

Effectively managing both public and private Splunk Apps across multiple Splunk environments poses a considerable challenge, demanding significant time and effort with the potential for tedious and manual tasks. Recognizing this complexity, the Splunk Cloud Service has been progressively introducing additional features and capabilities to streamline and simplify these intricate administrative responsibilities.

Streamlining Cloud Costs With Smart Management Strategies

Cost optimization within cloud services is not just about cutting services; it’s about investing resources wisely to achieve greater efficiency and growth. Amazon Web Services (AWS) continues to be a leader in providing solutions that help businesses manage and optimize their cloud spending. This guide aims to guide you through the complex world of AWS cost management, highlighting key indicators and tools essential for keeping your cloud expenses in check.

What To Do When A Customer (Or Segment) Is Costing Your SaaS Business Too Much

You’re a responsible SaaS company leader, so you understand the importance of tracking your cloud costs in detail. Perhaps you’ve even begun working with us at CloudZero, and you’re starting to see data and insights hit your dashboard. If so, you may have noticed — because this happens to all of us in the SaaS world at some point — that some customers cost your business far more than others. Suppose you’re also tracking your revenue per customer.

Elastic recognized with 2024 EMA Allstars award for its AI-assisted observability

We are thrilled to be recognized with the 2024 EMA Allstars award. This award acknowledges Elastic’s focus on delivering a full-stack observability solution that provides unified visibility and AI-powered insights into complex hybrid cloud deployments. The EMA Allstars award celebrates trailblazers and innovators who are reshaping the enterprise technology landscape.

Grafana Unleashes Official InfluxDB V3 Data Source: A Quick-start Guide to Configuration and Usage

Yes, the title says it all: Grafana released the official V3 plugin for InfluxDB Data Source! Before delving into the tutorial, we’d like to thank Ismail Simsek, a Tech Lead at Grafana. Ismail was pivotal in adding the V3 SQL plugin to the InfluxDB data source and making significant backend code improvements. To clarify, this release isn’t an entirely new data source.