Operations | Monitoring | ITSM | DevOps | Cloud

Putting Cloud Cost Optimization On Autopilot

When I co-founded CloudZero, I set out to change the way the world thinks about cloud cost optimization and management. For too long, I had watched cloud cost optimization vendors neglect to help companies build better, more cost-effective software. Instead, everything focused on buying better, by optimizing prices and discounts.

Reducing False Positives in Capped Campaigns

As the adtech industry continues to expand and the volume of ads sold and served grows exponentially, the only way to manage the business is through programmatic advertising. This approach utilizes data insights and algorithms to automatically serve ads to the right user, at the right time, on the right platform, and at the right price. The speed and scale of online advertising means that adtech companies need to collect, analyze, and act upon immense datasets instantaneously, 24 hours a day.

GitOps your WordPress with ArgoCD, Crossplane, and Shipa

WordPress is a popular platform for editing and publishing content for the web. This tutorial will walk you through how to build out a WordPress deployment using Kubernetes, ArgoCD, Crossplane, and Shipa. WordPress consists of two major components: the WordPress PHP server and a database to store user information, posts, and site data. We will define these two components and store them in a Git repository.

CLI Installer

While new cloud native architectures are incredibly feature-rich, they can come with a high barrier to entry. Many getting started tutorials are pages long and can take forever to complete. But these always start with the first step of performing an installation. In the spirit of making the installation of Speedscale as simple as possible, we have designed a new interactive installer as part of the speedctl command line interface.

The new Check Overview is now live!

Today, I'm excited to share the release of a long-planned and requested feature - our new Check Overview Page. Until now, Checkly enabled you to troubleshoot single alerts, but a deep dive into the long-term performance trends was limited. That is not the case anymore. In the new Check Overview, we’re introducing the enhanced analytics in four distinct categories: The update is focused on two important outcomes.

Set up and observe a Spring Boot application with Grafana Cloud, Prometheus, and OpenTelemetry

Spring Boot is a very popular microservice framework that significantly simplifies web application development by providing Java developers with a platform to get started with an auto-configurable, production-grade Spring application. In this blog, we will walk through detailed steps on how you can observe a Spring Boot application, by instrumenting it with Prometheus and OpenTelementry and by collecting and correlating logs, metrics, and traces from the application in Grafana Cloud.

Using AI & ML for Application Performance (APM)

Today, IT and site reliability engineering (SRE) teams face pressure to remediate problems faster than ever, within environments that are larger than ever, while contending with architectures that are more complex than ever. In the face of these challenges, artificial intelligence has become a must-have feature for managing complex application performance or availability problems at scale.

Cloud Log Management Strategy & Best Practices

For IT Operations and Site Reliability Engineering (SRE) teams, logging is nothing new. In fact, collecting and analyzing logs is one of the oldest cornerstones of performance management. Logs have been part and parcel of APM workflows for decades. Yet the logging strategies that worked in eras past often fall short today. That’s thanks to the advent of cloud-native computing, which has ushered in fundamental new challenges in the way teams aggregate, analyze, and manage logs.