Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

ShipTalk Season 4 Finale: Engineering Excellence at AWS re:Invent

Welcome to the Season 4 finale of the Ship Talk podcast! Join special host Thomas Dockstader and several industry leaders at AWS re:Invent to discuss the intersection of AI and software delivery. The following is a series of interviews with partners, customers, and engineering leaders on the front lines of AI transformation. Don't miss the "Ship It or Skip It" segment, where our guests give their rapid-fire takes on everything from AI code reviews to the four-day work week.

Now in Harness DB DevOps: Percona Toolkit for safer MySQL schema changes | Harness Blog

If you've ever run an ALTER TABLE on a busy MySQL table in production, you know the feeling. The change is small. The risk isn't. Long-running table locks, queued writes, application timeouts, replication lag, a five-minute migration that turns into a half-hour incident review. We're shipping an integration that takes that anxiety out of the loop. Harness Database DevOps now supports Percona Toolkit for MySQL as part of Liquibase-based schema management.

Ask Cortex anything, right from Slack

The Monday morning thread. Someone asks who owns checkout-service. Someone else asks what changed in the Production Readiness Scorecard last week. A third person wants to know if the Kubernetes migration is blocking the launch next Thursday. The answers exist. They live in Cortex. But getting them into the thread means someone stops what they're doing, opens a tab, finds the data, and pastes it back. By the time they do, the conversation has moved on.

Misconfigured Alert Detection: Find the Alerts That Need Tuning

Netdata ships with hundreds of stock alerts. They cover a wide range of infrastructure conditions and they’re designed with sensible defaults. But “sensible defaults” and “correct for your environment” are not the same thing. A CPU threshold that’s perfectly reasonable for a build server might generate constant noise on a machine running batch jobs.

Who's on call? How Claude helped us calculate this 2,500x faster

Schedules are a core part of any on-call system. In ours, they define who to page and when. But people use them in lots of other ways too: checking their next shift, asking for cover while at the gym, keeping a Slack user group up to date, or updating a Linear triage responsibility. For many of our customers, they’re one of the main ways they interact with our product, and as they’re such a foundational part of On-call, it’s very important they work well.

Stop watching the looms: why the AI era belongs to infrastructure

I live in Manchester, England now. I moved here from Texas last summer (which is its own story), but the thing I wasn't prepared for is how the Industrial Revolution isn't history here. It's the city itself. And if you're American like me, you might need to hear this: the Industrial Revolution didn't start in the US. It started here. Manchester is where the modern world was born. You see it everywhere. The old cotton mills converted into apartments.

Poland's KSC Act Is Now in Force: Why NIS2 Compliance Starts with Infrastructure Automation

Poland’s implementation of the EU’s NIS2 Directive marks a decisive shift in how organisations think about cybersecurity, resilience, and operational risk. With amendments to the Act on the National Cybersecurity System (KSC Act) entering into force on 3 April 2026, enforcement expectations are now real, national, and significantly stricter than many organisations anticipated – including obligations for security controls, incident response, and supply‑chain governance.

Disaster Recovery Testing in Harness | Resilience Testing

In this video, we introduce Harness Resilience Testing and show you how to move beyond once-a-year DR drills to a continuously validated, pipeline-driven process. You'll see how Harness lets you validate regional failovers, check database replication lag under pressure, and confirm your hot standbys genuinely take over live traffic, all in one place. We also walk through a live DR test execution, showing exactly how Harness triggers the full failover sequence, runs every validation step automatically, and gives you a clear pass or fail result in real time.