Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

The rhythm of reliability: inside Canonical's operational cadence

In software engineering, we often talk about the “iron triangle” of constraints: time, resources, and features. You can rarely fix all three. At many companies, when scope creeps or resources get tight, the timeline is often the first element of the triangle to slip. At Canonical, we take a different approach. For us, time is the fixed constraint. This isn’t just about strict project management. It is a mechanism of trust.

Harnessing the potential of 5G with Kubernetes: a cloud-native telco transformation perspective

Telecommunications networks are undergoing a cloud-native revolution. 5G promises ultra-fast connectivity and real-time services, but achieving those benefits requires an infrastructure that is agile, low-latency, and highly reliable. Kubernetes has emerged as a cornerstone for telecom operators to meet 5G demands.

Store Docker images in Bitbucket with Bitbucket Packages | Bitbucket Blitz | Atlassian

In this video, I’ll show you how to store your Docker images directly in Bitbucket using Bitbucket Packages, so your code, CI/CD, and container images all live in one place. Bitbucket Packages is a native Docker registry for Bitbucket. By keeping your images alongside your repositories and pipelines, you can reduce tech stack complexity and enhance your security posture by managing permissions in a single system, rather than juggling yet another external registry, such as Docker Hub or Artifactory.

FinOps Insights for IT Leaders

FinOps insights for IT leaders often focus on cloud spend, but IT leaders know that real cost drivers extend across hybrid environments. Achieving clarity requires more than budget reports. It requires understanding how workloads behave over time, how performance and capacity shift, and where visibility gaps hide operational and financial risk. To support those efforts, we sat down with Tim Conley, creator of Galileo, to explore practical FinOps insights for IT leaders.

How to Track Down the Real Cause of Sudden Latency Spikes

Start with distributed tracing to find which service is slow, then use continuous profiling to see why the code is slow, and finally apply high-cardinality analysis to identify which users or conditions trigger the problem. It's 2 AM. Your phone buzzes. Users are reporting timeouts. The metrics dashboard shows p99 latency spiking from 200ms to 4 seconds, but everything looks normal—CPU at 60%, memory stable, no error spikes. A quick pod restart helps briefly, then latency climbs right back up.

Stop Treating Models Like Magic, Start Treating Them Like Binaries

In my previous posts, we discussed the where and the how of managing your ML assets. We showed you how JFrog Artifactory acts as a powerful, universal model registry (the “where”) and how the FrogML SDK serves as the gateway to get your models and metadata into it (the “how”). Now, let’s talk about the why.

FAQs, SchmAQs: The IT Automation Solution that Does the End-to-End Work for You

At some point in the last few decades, every enterprise convinced itself that the humble FAQ page was going to save IT. If you could just document everything (every how-to, every troubleshooting step, every tribal data nugget living in someone’s head) you could finally stop the ticket flood. The idea was for employees to self-service and avoid escalating to engineers while freely sharing knowledge across a de-siloed ecosystem. But of course, that’s not what actually happened.