Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

DevEx Unpacked 002 - DevRel, Donuts & Distributed Systems with Dan McKinney

Episode 002: In this episode of DevEx Unpacked, Alan Carson sits down with Dan McKinney, one of Cloudsmith’s earliest team members and now Head of Solutions Engineering. Dan reflects on his unique journey from writing docs and filming DevRel videos to leading high-stakes enterprise sales. Discover how Cloudsmith scaled from a two-person start-up to a platform trusted by global enterprises, why software supply chain security is more urgent than ever, and what features make developers and security teams lean in.

Rancher Live: The Kubernetes report card

Join Divya Mohan live on July 17th at 2 PM UTC on to explore OpenReports—a new project for unified, API-driven reporting. Discover how OpenReports simplifies capturing and consuming policy, security, and compliance reports via a vendor-neutral API. See live demos, real-world use cases, and learn how this project brings clarity and consistency to Kubernetes reporting. Don’t miss it!

Why your Shopware store feels fast until it doesn't

Shopware is a powerful platform, but its performance depends entirely on how it is used. In this article, we explore the most common and avoidable causes of slowdowns in production environments, including plugin overload, cache fragmentation, and misconfigured admin settings. Whether you are preparing for a high-traffic event or simply aiming to keep your storefront fast and responsive, this guide will help you identify the root causes of performance regressions.

No Sandwich, No Security: What This Week's Lunch Taught Me About DNS Blind Spots

Like many shoppers in the UK this week, I found myself staring at half-empty shelves in my local grocery store. In a small but frustrating twist, my usual sandwich, chicken mayo on malted bread, was nowhere to be found. The disruption wasn’t just about lunchtime preferences; it was part of a broader impact from cyberattacks that hit major UK retailers, including Co-op and Marks & Spencer.

An Easy Guide to Getting Started with Elastic APM

Code in production will break. Maybe a request takes too long, maybe it fails quietly, or maybe it works fine one minute and falls over the next. Logs can help, sure—but they don’t always show the full picture, especially when performance issues are involved. Elastic APM gives you a clearer view. It traces what your application is doing from incoming requests to database queries and everything in between.

A Simple Guide To GKE Cost Allocation And Cluster Spend

Running workloads on Google Kubernetes Engine (GKE) delivers impressive scalability and flexibility. Yet, it can also introduce a tricky challenge: tracking GKE costs accurately. Remember, GKE costs rarely scale linearly. Overprovisioned nodes, idle autoscalers, and orphaned workloads can quietly balloon your bill in the background. And while GKE’s native tools offer some visibility, they often miss the full picture.

Apache Spark security: start with a solid foundation

Everyone agrees security matters – yet when it comes to big data analytics with Apache Spark, it’s not just another checkbox. Spark’s open source Java architecture introduces special security concerns that, if neglected, can quietly reveal sensitive information and interrupt vital functions.

The Future of IT Is Human + Agentic: How Zero Ticket IT Is Reshaping Tech Careers

Automation has always stirred up fears of job loss. For IT professionals, the conversation has only grown louder with the rise of AI. But the truth is that the future of IT is not about replacement—it’s about reinvention. For decades, IT has been defined by its firefighting: manually resolving tickets, managing endless alerts, and fielding repetitive service requests. These tasks are ripe for automation, but automation doesn’t eliminate the need for IT talent.

How to Monitor Kafka Producer Metrics

Your Kafka producer pushed a million messages yesterday. Nice. But can you tell if they all made it? Or why did latency spike at 2 PM? Producer metrics help you determine that. They expose how long messages take to send, whether messages are getting stuck, and whether retries are piling up. Let’s go over which ones help while debugging and how to monitor them.