Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

The Impending SaaS Crisis: How AI Is Disrupting SaaS - And How You Can Prepare

At CloudZero’s most recent company retreat, we held an investor panel, where representatives from four of the VC firms investing in CloudZero fielded questions from our team. Unsurprisingly, a good deal of the conversation revolved around AI. A standout moment from this panel came when one investor described a vibe coding session he’d done about a month prior. “Vibe coding,” for the uninitiated, means using AI to build an application without writing any actual code yourself.

What craft means for Canonical

Last month Jon Seager (our Vice President for Ubuntu Engineering) wrote about crafting software: Multiple Canonical products have craft in their names: Snapcraft, Charmcraft, Rockcraft (and there are others in the works). Our craft products are tools for making software, for the software craftsperson. To be a maker of tools comes with responsibilities – when you decide what tools should be like, you are also deciding how people should work.

How to Automate Password Resets with Resolve #itautomation #ai #aiautomation #agenticai

Tired of password reset tickets? Meet RITA — Resolve’s AI IT agent. Understands natural language Verifies identity Resets passwords instantly No tickets. No delays. This is Zero Ticket IT in action. Watch Resolve’s Derek Pascarella show how it works.

Weaponized AI vs. AI Driven Security Posture Management: Why the Battle Starts in Misconfigurations

August 5, 2025, Las Vegas Black Hat 2025, Abnormal AI officially launched its Security Posture Management for Microsoft 365. This release marks a critical turning point. In an era where attackers weaponized AI to uncover and exploit misconfigured cloud environments at machine speed, reactive security simply can’t keep pace. Threat actors are now leveraging automated AI to scan systems, identify configuration drift, escalate privileges, and deploy zero‑day exploits in seconds.

Automating Network Diagrams for A Complete View of All Active and Passive Components

Accurately tracking how data center devices are connected—across switches, patch panels, structured cabling, and more—is essential for efficient data center operations. But for many teams, documentation still lives in static diagrams or outdated spreadsheets, requiring extensive manual effort. This is time-consuming and leads to inaccuracies that can cause delays in planning or troubleshooting and unnecessary risk. Sunbird DCIM changes that.

IP Optical Middle Mile Network Architectures for Rural America

In addressing the burgeoning demand for broadband connectivity in rural America, a robust and innovative IP Optical Network Architecture is essential. The architecture must incorporate a best-in-class multi-layer design optimized for middle-mile functionality, integrating both voice and security dimensions. A pivotal requirement is to decouple the last mile from the middle mile, ensuring that the last-mile solutions can remain agnostic to various technologies while still benefiting from a unified middle-mile infrastructure.

Avoid the Chaos Engineering bottleneck

Chaos Engineering is great, but by itself it can create bottlenecks that limit your reliability journey. FULL TRANSCRIPT: One of the things we've learned while building Gremlin and being the first Chaos Engineering tool to market is with all the greatness that comes with this approach, we've learned some of the downfalls, some of the drawbacks. And one of those is how you scale this practice.

How server-side tagging benefits complex operational systems

What was the biggest pain in your childhood when doing puzzles? Of course, the worst is when you need to somehow finish a plain-black segment of 300-400 pieces. Lost puzzle pieces proudly occupy the second place. Well, if you have chosen to work in the digital marketing or e-commerce industry, nothing changes. You still will suffer from huge black spots and tiny pieces of missing data. The bigger the company you work for is and the larger the amount of data you operate with, the more you will be affected by those information gaps.

Optimizing Legacy ML Systems with Real-World DevOps Practices

We chose to feature this article because it reflects exactly what OpsMatters stands for: practitioners solving real problems with practical DevOps thinking. When we came across Ashish's detailed breakdown of his experience modernizing a complex ML environment, it stood out for its clarity and actionable insights. We reached out to him to learn more about the work behind this case study, and with his permission, we are sharing it here so the broader community can benefit from these lessons in observability, cost optimization, and real-world DevOps execution.

Introduction to End-to-End Testing: Everything You Need to Know in 2025

End-to-end (E2E) testing is a crucial software testing methodology that ensures an application works flawlessly from start to finish. In today’s fast-paced development cycles (think Agile and DevOps), E2E testing helps teams validate entire user workflows – from the user interface on the front end, through any APIs or services, down to databases or external integrations – exactly as a real user would experience them.