Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

The Cool Evolution: Liquid Cooling in Data Centers

The Environmental and Efficiency Benefits of Liquid Cooling Data centers are infamous for their voracious appetite for energy. As the digital universe expands, so does the environmental impact of maintaining these centers. Enter liquid cooling, a technology with the potential to slash energy consumption and reduce the carbon footprint of data centers. Liquid cooling offers superior thermal conductivity compared to air.

Building trust through incident communication with Adrián Moreno, VP of Engineering at SumUp

Today, good incident communication isn't a nice to have—it's an absolute must. But where do you even start? To help answer that question, we sat down with the VP of Engineering at SumUp, ⁠Adrián Moreno Peña⁠, to get his perspective on how organizations of all sizes can share stellar comms no matter the situation. We discuss.

Critical Platform Engineering Metrics: KPIs that Matter for Success

Platform engineering metrics and/or platform engineering KPIs (Key Performance Indicators) can help us measure the success of this evolving approach and its impact on DevOps. According to our 2024 State of DevOps Report: The Evolution of Platform Engineering — 43% of organizations report that they have had a platform team from 3-5 years already. With this maturity, it’s important to measure what’s working and what isn’t working using the same indicators of success across the board.

Using eBPF to Debug eBPF

In one of our latest posts, StackState Co-Founder Mark Bakker described how eBPF revolutionizes observability and how StackState’s agents rely heavily on eBPF to capture and analyze the data moving through your cluster. Today, we’re looking at an example where our eBPF code failed and — by diving deep into the intricacies of eBPF implementation in the Linux kernel — share the tale of how we fixed it using even more eBPF.

CI/CD observability: Extracting DORA metrics from a CD pipeline

Last November, Dimitris and Giordano Ricci wrote a blog post about CI/CD observability that looked into ways to extract traces and metrics in order to get a better understanding of possible issues inside a CI/CD system. That post focused on getting data from a continuous integration (CI) system, and it really resonated with the community.

Cloud-based DCIM Software Powers Modern Data Center Operations

Traditionally, data centers have been managed using on-premises software – and for many companies, this solution has been sufficient. However, as the data center environment becomes more intricate and dynamic, a new approach to management is required. Cloud-based Data Center Infrastructure Management (DCIM) software has emerged as the next generation of management tools, offering unmatched flexibility, scalability, and cost-effectiveness.

What Is Intelligent IT Automation (and How Do I Get Started)?

So, you’ve been tasked with automating one or more of your tedious, time-consuming IT processes… but, what exactly does that mean? And perhaps more importantly, where on earth do you start? IT process automation (ITPA) can cover a broad spectrum of potential use-cases, ranging everywhere from the Service Desk, to the NOC, to Infrastructure, and well beyond.

12 best practices for DevOps and IT teams to handle monitoring alerts

"Music is noise that makes sense," said author Yann Martel, implying that if a sound doesn't make sense, then it is perceived as just noise. Noise can thus be defined as any alert that affects our senses and disturbs our peace without adding any value. The digital age drowns us in stimuli of all kinds all the time, making the struggle to ignore noise in order to filter for sense harder than ever.