Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Back to the Basics: The Foundational Role of DDI in Any Network

In the ever-evolving landscape of networking, there are a plethora of three-letter acronyms that make up the wonderful alphabet soup that is a part of every engineer’s vocabulary. Whether it’s TCP, UDP, SSH, or one of the many other dozens, one acronym is commonly left out of the discussion: DDI. These seemingly simple letters are often overlooked or rarely thought of, but they are a crucial foundation for managing a stable, secure, and efficient network.

Reward engineers who fix problems before they cause outages

Are you recognizing the good work engineers do to prevent outages? "The people that are out there doing good work to prevent fires from ever occurring, we're not often recognizing them. We're not often rewarding them. And once things go wrong, someone comes in and fixes it. That's great. That's needed. But we're rewarding that behavior. And so it becomes a bit of people are motivated by what behavior you reward.

Migrating from SVN to Git: Step-by-Step Guide

Article updated June 2024 Is your current Subversion (SVN) version control system not meeting the needs of your development team? Perhaps you’ve heard of Git, but you’re so entrenched in SVN that converting to a new version control system seems like a daunting task. Fear not! No task is insurmountable when you have the power of the legendary GitKraken Desktop on your side.

The Definitive Guide to Kubernetes Cluster Upgrades

Kubernetes continues to play a pivotal role in orchestrating containerized applications with its cloud-native capabilities. Of course, capabilities like flexibility and scalability mean organizations must be extra vigilant, especially when it comes to maintaining the health and efficiency of Kubernetes clusters.

Running ML/LLM models on Kubernetes Across Major Cloud Providers with Abhishek Choudhary

Abhishek, co-founder and CTO of @truefoundry, explores the complexities of building a machine learning platform on Kubernetes. Discover solutions to challenges like handling diverse hardware, managing large Docker images, and optimizing costs. Learn how True Foundry uses tools like Argo CD, Keda, and Istio to create efficient abstractions for data scientists and streamline ML operations.

Guide to Monitoring Webhook Performance Using Telegraf

Monitoring webhook performance is crucial to ensure reliable and efficient communication between your software/application and external services, as delays or failures in webhook processing can lead to significant data loss or service disruptions. Additionally, performance monitoring helps identify bottlenecks and optimize the system, ensuring a smooth and responsive user experience.

How to Interactive Rebase in GitKraken Desktop #shorts

Learn how to use GitKraken Desktop's interactive rebase editor! Whether you want to pick, reword, squash, or drop a commit, this handy feature makes it quick and intuitive. Rebase commits normally, edit commit messages, combine commits, or remove them entirely! Check out our latest GitKraken Desktop tutorial to see it in action.

Stay Ahead of Known Vulnerabilities with Automated Patch Management

The consequences of not patching are everywhere: remember the Log4j vulnerability that grants hackers complete access to your devices? The best way to prevent this from happening is to use a patched version of Log4j — so why did this become a catastrophic and prolific security vulnerability event? A: Because people hate, forget, or simply dismiss patching as a labor-intensive part of managing their infrastructure.

How Much Does AWS Fargate Cost?

Amazon Web Services (AWS) offers Fargate, a serverless container service that eliminates the need to manage underlying infrastructure. Containers ensure applications run reliably across different computing environments without the overhead of server management. Fargate works with Amazon Elastic Container Services (ECS) and Amazon Elastic Kubernetes Service (EKS). It allows developers to focus on building their applications, offering enhanced security through isolation.

Better multi-timezone support for On-call overrides

Today, we are bringing enhancements to on-call overrides. For many remote teams using Spike, we are addressing the need to manage overrides across multiple time zones. This new design makes it easy to see override times in the local time of the person taking over. It adds clarity and helps you be mindful about on-call times. We also focus on clearly showing who is taking over on-call duties, enhancing overall management and coordination.