Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Achieving Full Visibility: Modern Monitoring for Distributed Cloud Applications

Today’s applications are hybrid, cloud-centric, service-oriented, API-dependent, and geographically distributed. The monitoring practices we relied on for decades are no longer sufficient. It is critical to monitor all the internet-centric dependencies, connectivity, and cloud application components – and to do so from the user’s perspective so IT operations teams can achieve digital resilience and deliver performance. This session will cover DEM, APM, and IPM and how they can work together to pinpoint issues before they occur, so users receive a great digital experience.

Pepperdata Resource Optimization for Data Workloads on Kubernetes

Struggling with underutilized Kubernetes resources or rising cloud costs? Learn how Pepperdata Capacity Optimizer delivers real-time, automated resource optimization for Kubernetes and Amazon EMR workloads—helping teams reduce costs and boost performance without manual tuning. In this video, discover how Pepperdata helps DevOps, platform engineers, and FinOps teams.

Self-hosted runners vs cloud CI/CD: A complete decision guide

Your CFO just asked about operational efficiencies across the engineering org. Tooling budgets are under the microscope, and suddenly CI/CD costs are getting attention. Sound familiar? When the pressure’s on to cut software spend, CI/CD often looks like a tempting target. It’s visible, measurable, and seemingly easy to move.

32 Best FinOps Tools For 2025: Features And Comparison

In recent years, cloud financial management has evolved beyond what many cloud stakeholders anticipated. The overwhelm has led too many companies to struggle to accurately monitor, allocate, and optimize their cloud costs. This issue cost companies about 30% of their cloud budgets in 2022 alone, according to Gartner. With FinOps, you can prevent this bleeding without sacrificing innovation. Yet, taking a manual approach to FinOps can be inefficient and error-prone.

Navigating Shopware logs and slow pages in a real world scenario

A Shopware store goes from smooth to sluggish—pages take 10 seconds to load, even longer in some cases. What happened? In this post, we tell the true story of how one overlooked plugin setting nearly collapsed a storefront, and how it was resolved using native tools. If you’re shipping code in Shopware without clear performance observability, this is your wake-up call. Everything was working, until it wasn’t.

Enterprise Drupal: Why hosting all your apps on one platform matters

For many enterprises, Drupal has been the backbone of their web operations for years. It’s a battle-tested CMS that handles complex content needs with elegance. But business needs have evolved. Today, it’s rare for a company to rely only on Drupal. They are spinning up Python APIs, .NET backend services, Node.js apps, Java microservices — expanding their digital ecosystems around Drupal’s core.

Infrastructure monitoring with Site24x7 | Cloud, Kubernetes, and Hybrid Environments

Modern IT environments are dynamic, distributed, and constantly evolving. You need more than traditional monitoring to keep everything running smoothly. Site24x7 is your all-in-one, AI-powered infrastructure monitoring solution. What this video covers: Whether you're overseeing AWS, Azure, GCP, OCI, VMware, or Kubernetes, Site24x7 simplifies it all with a single agent and AI-driven insights.

GPU Powerhouse: Scaling an AI Cloud in the Heart of Europe

The AI revolution needs more than models - it needs massive infrastructure. And Julien Gauthier is building it. In this episode of Uplink, Julien, CEO of Arkane Cloud, joins host Michael Reid to unpack how his company scaled from 3D rendering and gaming to delivering GPU cloud services for AI workloads across the globe. We explore how Arkane built a 1,000-GPU cluster in Paris (with capacity for 6,000), the rise of inference workloads in Europe, and the real-world engineering and business challenges of deploying high-density infrastructure - including cutting-edge liquid cooling handling 135kW per cabinet.

Trace Distributed Map states for AWS Step Functions with Datadog

AWS Step Functions offers the Distributed Map state, enabling you to coordinate massively parallel workloads within your serverless applications. With this feature, a single Step Functions execution can fan out into up to 10,000 parallel workflows simultaneously, making it possible to efficiently process millions of items in parallel. This capability unlocks new possibilities for large-scale data processing, such as image transformation, log ingestion, or batch analytics.

Master Your AWS Cloud Environment With Observability

In many cases, cloud and on-premises environments exist side by side. The only way to maintain visibility into such intricate hybrid ecosystems is with a sophisticated end-to-end observability solution. Here are the key factors in choosing a comprehensive observability solution to help you master your AWS cloud and on-prem environments.