Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

GitKraken Desktop in 6 Minutes: Open a Repo, Run an Agent, Ship the Change

The fastest way to get up and running in GitKraken Desktop. In this tutorial, you'll open a repo, start an AI coding agent in its own worktree, review the agent's changes against your own work, and ship a pull request without leaving the app. What you'll learn: Chapters Help Center: help.gitkraken.com.

Ask Cortex anything, right from Slack

The Monday morning thread. Someone asks who owns checkout-service. Someone else asks what changed in the Production Readiness Scorecard last week. A third person wants to know if the Kubernetes migration is blocking the launch next Thursday. The answers exist. They live in Cortex. But getting them into the thread means someone stops what they're doing, opens a tab, finds the data, and pastes it back. By the time they do, the conversation has moved on.

Misconfigured Alert Detection: Find the Alerts That Need Tuning

Netdata ships with hundreds of stock alerts. They cover a wide range of infrastructure conditions and they’re designed with sensible defaults. But “sensible defaults” and “correct for your environment” are not the same thing. A CPU threshold that’s perfectly reasonable for a build server might generate constant noise on a machine running batch jobs.

Who's on call? How Claude helped us calculate this 2,500x faster

Schedules are a core part of any on-call system. In ours, they define who to page and when. But people use them in lots of other ways too: checking their next shift, asking for cover while at the gym, keeping a Slack user group up to date, or updating a Linear triage responsibility. For many of our customers, they’re one of the main ways they interact with our product, and as they’re such a foundational part of On-call, it’s very important they work well.

Stop watching the looms: why the AI era belongs to infrastructure

I live in Manchester, England now. I moved here from Texas last summer (which is its own story), but the thing I wasn't prepared for is how the Industrial Revolution isn't history here. It's the city itself. And if you're American like me, you might need to hear this: the Industrial Revolution didn't start in the US. It started here. Manchester is where the modern world was born. You see it everywhere. The old cotton mills converted into apartments.

Poland's KSC Act Is Now in Force: Why NIS2 Compliance Starts with Infrastructure Automation

Poland’s implementation of the EU’s NIS2 Directive marks a decisive shift in how organisations think about cybersecurity, resilience, and operational risk. With amendments to the Act on the National Cybersecurity System (KSC Act) entering into force on 3 April 2026, enforcement expectations are now real, national, and significantly stricter than many organisations anticipated – including obligations for security controls, incident response, and supply‑chain governance.

Disaster Recovery Testing in Harness | Resilience Testing

In this video, we introduce Harness Resilience Testing and show you how to move beyond once-a-year DR drills to a continuously validated, pipeline-driven process. You'll see how Harness lets you validate regional failovers, check database replication lag under pressure, and confirm your hot standbys genuinely take over live traffic, all in one place. We also walk through a live DR test execution, showing exactly how Harness triggers the full failover sequence, runs every validation step automatically, and gives you a clear pass or fail result in real time.

Your AWS Kiro Agent Can Now Query CloudZero. Here's What To Ask It

CloudZero's new AWS Kiro integration puts cost intelligence directly in your agentic IDE. Ask plain-language questions about spend, attribution, and cost-per-serve without leaving your development workflow. We see a similar pattern playing out across engineering teams running agentic development tools: code gets shipped fast, something moves in the cost data, and understanding why still requires leaving your environment entirely.