Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

OTel Weaver: Consistent Observability with Semantic Conventions

Deploying a new service shouldn’t break dashboards. But it happens, usually because metric names or labels aren’t consistent across teams. You end up with traces that don’t link, metrics that don’t align, and queries that take hours to debug, not because the system is complex, but because the telemetry is fragmented. OTel Weaver addresses this by enforcing OpenTelemetry semantic conventions at the source.

Are Egress Fees Holding Your AI Business Back?

For AI companies, the landscape of cloud computing has always been a balancing act between innovation, costs, and compliance. That’s where Civo comes in. Offering a full cloud offering with GPUs, but without the usual headaches, Civo provides a rare combination: true data sovereignty and zero data egress charges. Let’s break down why these two features should be non-negotiables for your AI infrastructure.

SD-WAN, SASE, SSE, and the Coffee Shop Network: From Distraction to AI Superpower

Back in 2018, I wondered (perhaps loudly if SD-WAN was just IT’s hype-of-the-year, destined for the same eye-rolls as signature-based antivirus and GDPR compliance drives. Even then, I knew we couldn’t let messaging fatigue blind us to real technology shifts. Fast-forward to 2025: SD-WAN (Software-Defined Wide Area Network) not only stuck around, but became the springboard to something far bigger – SASE (Secure Access Service Edge).

9 Best IT Alerting Software in 2025 (Plus 3 Open-Source Options)

I’ve curated a list of 9 best IT alerting software and 3 open-source alternatives for you. Every tool on this list handles the core alerting functions you need: incident detection, fast alert delivery, clear escalation paths, and reliable incident logging. Since all these tools tick those boxes, I focused on what makes each tool special. You’ll find their unique features under “Standout Alerting Features of ” for each option.

JFrog Deployed on AWS: The Foundation for Cloud-Native Excellence

We are delighted to share the exciting news that JFrog has earned the “Deployed on AWS” badge in AWS Marketplace, marking yet another milestone in our journey of innovation and collaboration with Amazon Web Services (AWS). This achievement underscores our commitment to providing cutting-edge solutions that leverage AWS’s robust infrastructure to enhance the user experience and drive efficiency.

Git Rebase -i: Clean Your Commit History

Messy commit history? Git lets you clean it up before anyone sees it — and it’s easier than you think. In the final episode of Wait… Git Can Do That? — Volume 1, we walk through git rebase -i: Clean up the last N commits Squash, rename, or drop them Bonus: GitKraken Desktop lets you do it all visually You've just unlocked 8 Git powers most devs don’t even use.

Azure Reserved Instances: Saving Smart, Maximizing ROI

Many teams buy RIs with the best of intentions (predictability and up to 72% savings) only to realize later that they’ve either overcommitted or left money on the table. Without clear visibility, what starts as a smart cost-saving move can slither into silent waste. This guide will help you get ahead of that. We’ll walk you through the ins and outs of Azure Reserved Instances, compare them to other savings options, and share best practices to help you avoid common pitfalls.

How sum_over_time Works in Prometheus

The sum_over_time() function in Prometheus gives you a way to aggregate counter resets, gauge fluctuations, and histogram samples across specific time windows. Instead of seeing point-in-time values, you get the cumulative total of all data points within your chosen range—useful for calculating totals from rate data, tracking accumulated errors, or understanding resource consumption patterns over custom intervals.