Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

FinOps Is The Margin Lever SaaS CEOs Keep Ignoring

You’re probably not combing through cloud bills. That’s not your job as CEO. But if no one on your executive team can tell you what it costs to serve a customer, ship a feature, or launch a new product line, that’s a problem. Not a someday problem. A right-now, quietly-draining-your-margins kind of problem. FinOps tends to get lumped in with cost-cutting — some finance thing, some DevOps thing. But that framing misses the point. Done right, FinOps is a growth enabler.

Top SaaS Companies Defining The Future Of SaaS

Picture this. Gartner forecasts worldwide end-user spending for public cloud usage to total more than $720 billion in 2025 — up from $595 billion in 2024. Out of that spend, SaaS will make up a chunky $299 billion. For comparison, Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) will make up nearly $212 billion and $209 billion, respectively. Elsewhere, BetterCloud’s State of SaaS 2025 report found that the average organization uses 106 different SaaS tools.

GoodRx Releases Lifecycle Solution for Ephemeral Developer Environments with Built-in Support for Codefresh Pipelines

GoodRx, a digital healthcare platform, has released the Lifecycle project as open-source code. Lifecycle is a complete solution for temporary/ephemeral environments. The project’s build process includes built-in support for Codefresh pipelines.

DORA Compliance: How Upsun supports our financial services customers

The Digital Operational Resilience Act (DORA) is set to reshape how financial institutions in the EU manage and contract with their technology providers. Since January 17, 2025, DORA requires financial entities to meet stricter rules for managing digital risks, especially when it comes to the third-party ICT (Information and Communication Technology) service providers they rely on.

6 OpsGenie Alternatives for On-Call Management

You’re likely here because you heard the news: Atlassian ended new sales for OpsGenie on June 4, 2025, with a complete shutdown scheduled for April 2027. For years, OpsGenie has been the backbone of on-call management for countless teams. It might have been your team’s trusted solution too. But now, that chapter is closing. The pressure to find an OpsGenie alternative for on-call is real. However, you can’t just pick any tool and hope it works for your team.

Improve Consistency Across Signals with OTel Semantic Conventions

It’s 2 AM. Your API is timing out. Logs show a slow query. Metrics flag a spike in DB connections. Traces reveal a 5-second delay on a database call. But then the questions start:- Which database?- Does the query match the delay?- Why doesn’t this align with the connection pool metrics? Each tool uses different labels, db.name, database, sometimes nothing at all. Without a shared schema, connecting the dots is slow and frustrating.

How Replicas Work in Kubernetes

Replicas in Kubernetes control how many copies of your pods run simultaneously. They're the foundation of scaling, availability, and recovery in your cluster. When you're running a stateless API or a background worker, understanding how replicas work directly impacts your application's reliability and performance. This blog walks through replica management, from basic concepts to production monitoring patterns that help you maintain healthy, scalable applications.

See System Logs Alongside your Metrics Using Loki, Grafana, and Graphite

In this quick demo, we show how you can transform logs collected by Grafana Loki into actionable Graphite metrics using MetricFire. Watch as we convert structured logs into performance insights. Perfect for teams looking to bridge the gap between logging and monitoring. This workflow helps you move beyond basic log storage and turn raw logs into meaningful metrics for alerts, dashboards, and capacity planning.

Why we're talking to people about reliability

Reliability means a lot of things to a lot of people, but it’s also essential for every digital business. That’s why we’re talking to reliability experts from all over to find out what reliability means to them and how you can improve it. Transcript:  You know, we're all out here building and operating digital businesses and like nobody's talking about reliability enough. We gotta talk about it. I can't stop talking about it and I've been on call for like 20 years.