Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Reward engineers who fix problems before they cause outages

Are you recognizing the good work engineers do to prevent outages? "The people that are out there doing good work to prevent fires from ever occurring, we're not often recognizing them. We're not often rewarding them. And once things go wrong, someone comes in and fixes it. That's great. That's needed. But we're rewarding that behavior. And so it becomes a bit of people are motivated by what behavior you reward.

Back to the Basics: The Foundational Role of DDI in Any Network

In the ever-evolving landscape of networking, there are a plethora of three-letter acronyms that make up the wonderful alphabet soup that is a part of every engineer’s vocabulary. Whether it’s TCP, UDP, SSH, or one of the many other dozens, one acronym is commonly left out of the discussion: DDI. These seemingly simple letters are often overlooked or rarely thought of, but they are a crucial foundation for managing a stable, secure, and efficient network.

Reliability-Driven Fleet Management with Komodor

Maintaining a few K8s clusters is hard enough. Maintaining 1000+ clusters is virtually impossible without embracing new tooling and paradigm shifts. Join us for an insightful LIVE workshop exploring the possibilities of Kubernetes Fleet Management with Komodor, lead by Itiel Shwartz* In this session, we will dive into the challenges of multi-cluster management and how Komodor's comprehensive platform simplifies operations. Discover how to gain real-time visibility into your clusters, automate routine tasks, and troubleshoot issues across your entire fleet efficiently.

Monitoring as Code and Checkly Listed in the Gartner Hype Cycle for the Second Consecutive Year

I'm excited to share that Gartner has included Monitoring as Code (MaC) as an emerging practice to their Hype Cycles for SREs again, the second year in a row. Since we founded Checkly, our vision has been that monitoring should sit in your repository, be codified, and scale with your software development. There is no alternative to MaC as it allows your engineering team(s) to work together, create and maintain checks, and ultimately own their monitoring.

Managed Apps on Public Cloud: Why Operations Matter, Part I

You might be tempted to think that running an app on a public cloud means you don’t need to maintain it. While that would be wonderful, it would require help from the public cloud providers and app developers themselves, and possibly a range of mythological creatures with magic powers. This is because any app, regardless of the infrastructure on which it runs or its output, requires maintenance in order to yield accurate and reliable outputs.

Merging to Main: Productizing the Internal Developer Platform

During this session we went live with Steve Fenton (Octopus Deploy), Bartek Antoniak (VirtusLab), Jordan Chernev (Zillow), and Scott Hiland (Hunter Strategy). The discussion was around how to "productize" a developer platform inside an org - a look at what best practices look like and experiences and advice around that.

Stay Ahead of Known Vulnerabilities with Automated Patch Management

The consequences of not patching are everywhere: remember the Log4j vulnerability that grants hackers complete access to your devices? The best way to prevent this from happening is to use a patched version of Log4j — so why did this become a catastrophic and prolific security vulnerability event? A: Because people hate, forget, or simply dismiss patching as a labor-intensive part of managing their infrastructure.