Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

An exciting update on diff and merge performance

Back in July I published Encountering some turbulence on Bitbucket's journey to a new platform, sharing with the public for the first time that Bitbucket Cloud is in the final stages of a migration from our data center onto Atlassian's cloud platform—the same internal platform underlying Jira Cloud, Confluence Cloud, Statuspage, and many other internal services. I also shared that because of increased file system latency as a result of this platform move, certain operations have become slower.

How to monitor your disks and filesystems, now also with eBPF

Current IT monitoring software lacks the necessary metrics for minimizing downtime for systems and applications. Most provide system and application metrics but there is much more than this required for properly monitoring your infrastructure. With eBPF there is a technological advancement that allows monitoring software to provide rich information from the Linux kernel and present it.

August/2021 - HAProxy 2.0+ HTTP/2 Vulnerabilities Fixed

If you are using HAProxy 2.0 or newer, it is important that you update to the latest version. A vulnerability was found that makes it possible to abuse the HTTP/2 parser, allowing an attacker to prepend hostnames to a request, append top-level domains to an existing domain, and inject invalid characters through the :method pseudo-header.

[Webinar] Take a Walk On the Safe Side: Using Feature Flags W/ Komodor & Sentry

Feature flags are enabling developers to deploy with confidence, safe in the knowledge that they can always disable buggy features in production without changing the code. While this increases the speed, stability, and frequency of software updates, it’s no magic trick and still requires planning in advance and using the right tools.

Systems Management Named Biggest Area for IT Professional Development

The 2020 SolarWinds® IT Pro Day survey looked at the changing roles and responsibilities of IT pros throughout the pandemic. Overall, nearly two-thirds of surveyed IT pros discovered a new sense of confidence, despite contending with challenges such as reduced budgets, greater decision-making responsibilities, and longer hours caused by their organizations’ response to the pandemic.

How CloudZero Manages Cloud Costs During Our Product Discovery Process

Like any SaaS organization, CloudZero cares about our COGS, margin, and unit cost. As you can imagine, we have a fair amount of insight into how those numbers are changing every day thanks to being “customer zero” on our cloud cost intelligence platform. Talking with other companies, I often find that the best organizations understand that measuring the cost implications of new products and features once you’ve deployed them can be too late.

Getting Started with Site Reliability Engineering

Site Reliability Engineer (SRE) is one of the fastest growing jobs in tech, with Linkedin reporting 34% growth YoY in 2020 and over 9000 openings in their Emerging Jobs Report. If you’re new to SRE and exploring it as a career path, understand that it can be a challenging but rewarding experience. Here are some quick tips on how you can get started with SRE and jump-start a rewarding career.