Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

10 Most Common Kubernetes Reliability Risks

Reliability risks are potential points of failure in your system where an outage could occur. If you can find and remediate reliability risks, then you can prevent incidents before they happen. In complex Kubernetes systems, these reliability risks can take a wide variety of forms, including node failures, pod or container crashes, missing autoscaling rules, misconfigured load balancing or application gateway rules, pod crash loops, and more. And they’re more prevalent than you might think.

Top 4 Crossplane Alternatives & Competitors

The evolution of cloud infrastructure management has been significantly influenced by the development of Infrastructure as Code (IaC) tools, among which Crossplane stands out as a pioneering solution. CrossPlane, an open-source project, revolutionizes how developers manage and orchestrate cloud services by extending Kubernetes with powerful abstractions for multi-cloud environments.

What is Kubernetes Pod QoS?

Container orchestration, Kubernetes has emerged as a leading platform for managing and deploying containerized applications. One fundamental concept that plays a crucial role in ensuring optimal performance is the Quality of Service (QoS). In the realm of Kubernetes, this concept is applied at the level of Pods, forming the backbone of resource management within the cluster.

Resource Constraints in Kubernetes and Security

The Sysdig 2024 Cloud‑Native Security and Usage Report highlights the evolving threat landscape, but more importantly, as the adoption of cloud-native technologies such as container and Kubernetes continue to increase, not all organizations are following best practices. This is ultimately handing attackers an advantage when it comes to exploiting containers for resource utilization in operations such as Kubernetes.

Datadog on Kubernetes Autoscaling

Datadog, the observability platform used by thousands of companies, runs on dozens of self-managed Kubernetes clusters in a multi-cloud environment, adding up to tens of thousands of nodes, or hundreds of thousands of pods. Also, this infrastructure is used by a wide variety of engineering teams at Datadog, with different features and capacity needs that may also change overtime.

Service mesh and ingress controllers: Bringing the outside world in

The first problem that any cloud-native application has to solve is how to communicate with the world outside the cluster. This is “the ingress problem”, and while service meshes don't have to solve it directly, it is absolutely a major part of successfully getting your application working with one! Join us for a whirlwind tour of how service meshes interact with ingress controllers using the Linkerd service mesh with Emissary-ingress, NGINX, and Envoy Gateway.

Announcing Longhorn 1.6.0

The Longhorn team is excited to announce the latest minor release, version 1.6.0! This release introduces several features, enhancements, and bug fixes that are intended to improve system quality and the overall user experience. Specifically, this release includes a further feature preview of the highly anticipated Longhorn Data Engine Version 2.0, platform-agnostic deployment, node maintenance, and improvements to stability, performance, and resilience.