Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

New Custom Dashboards: Metrics, Logs, Live Commands, and More in a Single View

Custom dashboards in Netdata have always let you pull charts together on-the-fly into a single view. That’s useful, but it’s also limited. In practice, when you’re running an incident or reviewing a service, you don’t just want charts. You want to see the output of top alongside your CPU metrics. You want slow query logs next to your database latency charts.

UK sovereign cloud security standards to watch in 2026

The regulatory landscape governing UK sovereign cloud security has shifted more dramatically in the past 12 months than in the preceding decade. New legislation, tightened procurement frameworks, and an intensifying cyber threat environment are collectively raising the compliance floor for organizations running cloud workloads in the UK.

Getting more out of Playwright CLI: a practical guide for QA and DevOps teams

If your team runs Playwright tests in CI, you already know the npx playwright test drill. It works fine until your suite crosses a few hundred tests. Then things get messy. Flaky reruns stack up. Debugging means downloading trace zip files and opening them on your laptop. Reports? Static HTML files that people stop checking after day 3.

What Is Snowflake? A Beginner-Friendly Guide

Imagine if you had a magic box where you could keep all your business information — sales numbers, customer feedback, everything — safe and sound, but also easy to look at whenever you needed. That’s kind of what Snowflake does, but for big organizations and using the cloud. It’s a new way for companies to store and use their data without getting bogged down by the techy details.

Manage Hyperping with Terraform: Community Provider by Develeap

If you manage more than a handful of monitors, you have probably wanted to define them in code rather than clicking through a dashboard. Terraform is the standard tool for that in the infrastructure world, and now there is a Terraform provider for Hyperping. Develeap, a DevOps consultancy, built this provider while managing monitoring for 57 tenants at scale. They needed infrastructure as code for monitors, status pages, and incidents, so they built it, tested it in production, and open-sourced it.

How PayPal hyperscaled Kubernetes routing with HAProxy Fusion

PayPal runs six data centers, each with around 60,000 containers. Their 30,000 employees spin up nearly 10,000 test environments every day — roughly 6 to 10 every minute. Each environment requires three config updates: one to create the virtual service, and additional calls to configure and deploy the applications. Do the math and you get a staggering 30,000 config updates per day.

Why DR Testing Can No Longer Be an Afterthought | Harness Blog

Regular DR testing is no longer a compliance checkbox — it is a critical engineering discipline that determines whether an organisation can survive a real cloud outage with its services and revenue intact. As the AWS Middle East incident demonstrated, regional cloud failures can strike without warning and defeat standard redundancy models, making untested DR plans dangerously unreliable.

From One Month to One Day: How CloudZero Builds Cloud Cost Connectors at the Speed of AI Adoption

Not long ago, adding a new cost connector to CloudZero was a serious undertaking. We’d task multiple engineers, build in extended review cycles, run a private preview period. But a single connector could take up to two months from kickoff to customer hands. For the major cloud providers, that timeline was acceptable. The size of the investment matched the scale of the integration. But the tools landscape has changed. Our customers’ teams don’t just run on AWS and Azure.