Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

What is Docker Swarm and How Does it Work?

For most organizations, having a stable and reliable IT infrastructure is essential for success. But managing multiple servers, databases, and applications can often be difficult and time-consuming. Container orchestration is a standard solution for handling this complexity. Docker Swarm has gained popularity as a container orchestration solution because of its simplicity and scalability.

Troubleshooting ECS Container Crashes

Amazon Elastic Container Service (ECS) is a versatile platform that enables developers to build scalable and resilient applications using containers. However, containerized services, like Node.js applications, may face challenges like memory leaks, which can result in container crashes. In this blog post, we’ll delve into the process of identifying and addressing memory leaks in Node.js containers running on ECS. First, let’s look closer at what a memory leak is.

Kubernetes Community Days Munich Recap

A couple of weeks ago I had the absolute joy of attending KCD Munich for the first time, with my friend and colleague Guy Menahem (whom some of you know simply as The Good Guy on Twitter and YouTube). Besides rooting for Guy and his co-speaker, Arsh Sharma of Okteto, during their session on Backstage.io and IDPs, I enjoyed being untethered from ‘booth duty’ and free to engage with all the beautiful human beings that gathered together for this Kubetastic event!

Demo Roundup: PagerDuty Operations Cloud for Kubernetes

In this demo, Corbin Mills shows how to use the PagerDuty Operations Cloud to streamline and automate how a node failure is resolved. You’ll see how he uses event orchestration (in PagerDuty AIOps) to enrich an alert with pod names, and automatically runs a job to check the Kube API status, so that a responder has instant context. AIOps is also grouping and suppressing alerts. Then you’ll see how the responder can run more health status checks without the need to SSH into the environment or interrupt a co-worker for access.

cert-manager can do SPIFFE? - Civo Navigate NA 2023

Ashley Davis, Senior Software Engineer and Maintainer of cert-manager, discusses the capabilities of cert-manager, an easy way to manage certificates in Kubernetes clusters. Ashley highlights the importance of Trust-manager for managing trust bundles, enabling clients to verify certificate legitimacy. Additionally, he explores the potential of using x509 certificates as a universal identity control plane in distributed systems through the concept of "SPIFFE" (Secure Production Identity Framework For Everyone).

Kubernetes Incident Management Best Practices

Creating just any infrastructure on Kubernetes is not enough. There are so many basic configurations you could apply and create the infrastructure for your application for the time being and it might work just fine. The incident responses won’t always remain 100% reliable. You will run into newer potholes, and that’s okay.

How to save on container costs efficiently using Kubernetes cost reporting in CloudSpend

Kubernetes reports in CloudSpend In the current era focused on cloud computing, it is essential for businesses to streamline costs. As containerization and Kubernetes become increasingly popular, efficiently managing costs related to Amazon Elastic Kubernetes Service (EKS) and Azure Kubernetes Service (AKS) is crucial for maintaining a successful infrastructure.

Securing Access to Cloud Native Resources with Certificates - Civo Navigate NA 2023

In this talk, Alan Vailliencourt, a Senior Solutions Engineer with Teleport, discusses the importance of moving away from passwords and securing access to cloud-native resources using short-lived certificates. He highlights the risks associated with passwords and showcases the benefits of identity-native access, incorporating proof of presence, mutual authentication, and device security. The talk provides practical steps for adopting certificate-based authentication and improving security posture for Kubernetes, databases, and other cloud resources.

Kubernetes Troubleshooting Reimagined: Operators and Auto-Tracing

Kubernetes operators help to simplify, streamline, and automate application tasks beyond the conventional Kubernetes offerings. In this webinar, AWS Developer Advocate for Kubernetes, Lukonde Mwila, will delve into the remarkable capabilities of Kubernetes operators and how to leverage them in your applications. You’ll also learn how Lumigo built a Kubernetes operator for seamless distributed tracing leveraging OpenTelemetry. We will also demonstrate how our operator transforms complex processes into a single command, promising an unmatched user experience and exceptional app health insights.