Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Introducing The Enhanced CloudZero Academy: Learn, Grow, And Level Up Your FinOps Skills

If there’s one thing we’ve learned at CloudZero, it’s that success in FinOps isn’t just about having the right tools. It’s about knowing how to use them, and understanding the “why” behind every number, dimension, and dashboard.

ML inference in PHP by example: leverage ONNX and Transformers on Symfony

This blog is based on a presentation by Guillaume Moigneu at the Symfony 2024 conference. Machine learning and AI are no longer limited to Python and Node.js. PHP developers can now run AI models directly in their applications using modern tools and libraries. This guide shows you how to implement machine learning inference in PHP using ONNX and Transformers.

Modern Service Architecture for High-Velocity Operations

Modern service architecture supports organizations that target sustained velocity, predictable delivery cycles, and scalable global operations. Cloud-native platforms, microservices patterns, and distributed execution models now anchor these environments. Modern service architecture emphasizes modularity and flexibility, which contrasts with traditional monolithic approaches. The 2025 Gartner Magic Quadrant for Cloud-Native Application Platforms identifies AWS, Red Hat OpenShift, and Heroku as leaders because they strengthen developer experience, platform engineering, and security.

Cloud Security Best Practices Every Company Should Follow

As more businesses move their data, applications, and daily operations to the cloud, securing that environment has become a top priority. Cloud platforms offer flexibility, scalability, and cost savings, but they also introduce shared responsibility-meaning both the provider and the business must play a role in keeping systems safe. Understanding essential cloud security best practices helps organizations reduce risk, protect sensitive information, and maintain compliance in an increasingly digital world.

Elasticsearch: The context engine for grounding and orchestration in Microsoft Azure AI Foundry Agent Service

The rise of large language models (LLMs) and agentic applications promises to transform enterprise workflows. Yet, the core challenge remains: How do we ensure these powerful agents generate accurate, relevant, and trustworthy responses based on proprietary enterprise data rather than relying solely on their generic training knowledge? The answer lies in grounding — connecting the LLM to verified, trusted, and up-to-date information.

Azure Monitor offers Grafana dashboards natively for immediate, real-time operational monitoring

Editor’s note: This blog originally published in May 2025 when Azure Monitor dashboards with Grafana became available in public preview. It was updated in November 2025 to reflect general availability. The Grafanaverse just got a little bit bigger.

AWS And Azure Outages Will Recur - Here's How You Ensure Resilience

The cloud has long promised limitless scalability and near-perfect uptime. But if you tried to access your Microsoft 365 dashboard or recline your smart bed last week, and got nothing but a spinning icon, you weren’t alone. In the span of 10 days, both Amazon Web Services (AWS) and Microsoft’s Azure Cloud suffered widespread outages that rippled across industries.

KubeCon Atlanta Signals Key Shift: From Cloud Cost To Value Engineering

After three days of demos, sessions, and hallway conversations at KubeCon Atlanta, one thing became clear to CloudZero CTO Erik Peterson: the cloud-native world is shifting from cost control to value engineering. Teams aren’t just fighting bills anymore. They’re fighting complexity, GPU scarcity, Kubernetes sprawl, and pressure from the business to justify every dollar of technical investment. And this year’s KubeCon attendees? They were ready for those conversations.

AI API Aggregation: Managing Costs And Complexity Across Multiple LLMs

Running multiple LLMs without aggregation can feel like managing five different clouds with no dashboard. Sure, you can make it work, but you won’t like the bill. And most SaaS teams didn’t start with a multi-LLM strategy. It just happened. You added one model for reasoning, another for summarization, or maybe a fine-tuned version for customer support. Fast-forward six months, and your AI stack looks like a tangle of APIs. And each charges tokens on its own terms.