Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Scaling with Wildcard Certificates: Why Modern Infrastructure Benefits

Managing TLS certificates at scale is one of those operational tasks that starts simple and quickly grows into a sprawling problem. As organizations adopt microservices, multi-tenant architectures, and globally distributed load balancers, the number of domains and subdomains they support can expand dramatically. Each certificate then requires its own lifecycle management: Wildcard certificates offer a powerful solution to this growing complexity.

Secure by Default: Why AI-Driven Delivery Needs a Rethink

AI speeds delivery but expands risk. Teams need context, verification, behavior detection, and learning to stay secure by default. Software delivery has been accelerating for more than a decade, and the arrival of AI has pushed us into an entirely new velocity class. Code generation, configuration scaffolding, infrastructure suggestions, remediation hints, and deployment decisions now involve AI. It participates in every stage of the delivery pipeline. On the surface, this feels like progress.

Harness AI November 2025 Updates: AWS Integration, Database DevOps, & Enterprise-Grade AI Across the SDLC

November was another big month for Harness AI, with new capabilities that deepen our work with AWS, bring AI-native automation to the database, and keep our model stack on the cutting edge across the SDLC.

Efficiency at any scale: How HAProxy maximizes the benefits of modern multi-core CPUs

Unlock peak load balancing performance with HAProxy! In this blog post, we'll explore how HAProxy intelligently harnesses the power of modern multi-core CPUs while navigating challenging architectural complexities like NUMA. Discover how HAProxy leverages optimized multithreading and provides automatic CPU binding to deliver both unparalleled efficiency and speed, ensuring your load balancing is faster than ever.

Cost Optimization Is Now Part of the SRE Playbook

In the era of cloud-native architectures, Site Reliability Engineering (SRE) has matured from a discipline focused purely on uptime to a sophisticated practice of efficient reliability. The key driver for this evolution is an undeniable truth: cloud spend has become intrinsically linked to system stability.

The Agentic Solution Making AI's Value Clear to IT, Execs, and Customers

Leaders in every industry are investing heavily in AI. Shocking, I know. Operations teams are modernizing infrastructure and automating workflows while boards are asking for faster returns. And yet, for all the investment, one question still lingers: where’s the value? The truth is that most enterprises have a translation problem, not necessarily ‘just’ a visibility problem. Executives see AI as a growth strategy, but IT sees it as operational complexity.

Automate infrastructure operations with Datadog Infrastructure Management

Many organizations struggle to track how their cloud infrastructure changes over time. Modern environments span tens of thousands of resources across hundreds of accounts and multiple clouds. Application teams add new services and regions at a rapid pace, increasing the number and variety of resources that need to be managed. These shifts can cause infrastructure configurations to drift from a well-architected state, increasing the risk of service reliability issues and unexpected cloud spend.