Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How Engineers Get Leadership Buy-In for Technical Initiatives

Getting leadership to greenlight your technical work isn't about having the right answer, it's about speaking the right language. CircleCI CTO Rob Zuber shares the frameworks he's developed over 12 years for translating engineering priorities into business impact, navigating organizational dynamics, and building the relationships that make buy-in happen before you ever enter the room.

Agent Skills move too fast for git

Last month I was making a change to sx, our CLI. I updated a core flow, adding external catalogs as a source for sx add. Small change. Then came the testing. I knew I was messing with a core flow and wanted to be sure I hadn't broken anything. I spent about forty-five minutes setting up an isolated environment. Spinning up Docker. Fighting with tmux. Getting a clean install state I could run through the TUI a few times. Forty-five minutes of my afternoon that produced zero code. I complained in Slack.

Autonomous AI for Cloud-Native Cost Optimization: Balancing FinOps and Performance SLAs

Platform Engineering leaders are caught between two competing imperatives. You’re under pressure to flatten cloud spend but your team is still provisioning defensively because nobody wants to be the person who causes a production incident. You try to optimize, but six months later, when someone pulls a report, nothing has changed.

How to set up rolling deployments with CircleCI

A rolling deployment updates running application instances in batches, replacing old instances with new ones while the application keeps serving traffic. The concept applies to any system that can run multiple instances of an application, but Kubernetes has it built in as the default deployment strategy. Kubernetes terminates an old pod only after its replacement passes the configured readiness check, so no requests land on an unready instance.

10 best practices for optimizing Kubernetes on AWS

Optimizing Kubernetes on AWS is less about raw compute and more about surviving Day-2 operations. A standard failure mode occurs when teams scale the control plane while ignoring Amazon VPC IP exhaustion. When the cluster autoscaler triggers, nodes provision but pods fail to schedule due to IP depletion. Effective scaling requires network foresight before compute allocation.

Choosing GPU cloud platforms for developers

For developers building AI applications, training models, or running inference pipelines, the GPU cloud market in 2026 has never offered more choice - or more complexity. Picking the wrong platform means overpaying, dealing with availability problems, or battling infrastructure that slows you down rather than accelerating your work.

How to define your monitoring requirements (before you talk to a vendor)

This is a guest post from Laura Copeland. Key insights from a fireside chat with Chris Yates. Part 1. Choosing the right database monitoring vendor isn’t just a technical decision, it’s a strategic one that affects your teams, your estate, your growth plans, and the culture of your organisation. It’s also a personal one if you’re a DBA. Something as critical as your monitoring system will shape your day‑to‑day work, and, in many cases, how well you sleep at night.