Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Beyond the Blue Screen: Insights from the Microsoft-CrowdStrike Incident

In the wake of the Microsoft-CrowdStrike incident on July 19, 2024, Squadcast community has been actively reflecting on the lessons learned from this disruptive event. This global outage, affecting 8.5 million Windows machines, has served as a critical case study for incident management and operational resilience.

The 6 Best Performance Testing Tools

In software development, load testing plays a critical role in ensuring that applications perform optimally under any imaginable load condition. To do this, developers subject applications to several types of load tests, including scalability, spike, endurance, and stress testing. The ultimate goal of these performance tests is to pinpoint potential bottlenecks and ensure the reliability of the overall system where the software application runs before reaching production.

Debugging your Rancher Kubernetes Cluster the GenAI Way with k8sgpt, Ollama & Rancher Desktop

The advancements in GenAI technology are creating a significant impact across domains/sectors, and the Kubernetes ecosystem is no exception. Numerous interesting GenAI projects and products have emerged aimed at enhancing the efficiency of Kubernetes cluster creation and management. From simplifying application containerization for engineers to addressing complex Kubernetes-related queries or troubleshooting issues within a cluster, GenAI demonstrates immense potential.

How to verify, document, and prove compliance with Gremlin

Resilient and reliable IT systems have become a minimum requirement for modern businesses—a fact driven home by any number of high-profile outages over the past few years. Unfortunately, when those outages are in the financial sector, it can have far-reaching and incredibly damaging results.

Day-0, Day-1, and Day-2 Operations: What Are the Differences?

Operations are the backbone of successful software delivery, but the specifics of each phase—Day-0, Day-1, and Day-2—often get overlooked. Understanding these phases can help you streamline deployments, reduce risks, and maintain robust, scalable systems. Let’s break down what each phase entails and explore their distinct activities, tools, and best practices.

MegaPod | Guilherme Soubihe, CEO of Latitude.sh

In this conversation, Michael Reid, CEO of Megaport, interviews Gui Soubihe, CEO and founder of Latitude, a company that offers Bare Metal and GPU as a Service. They discuss the growth of AI and the demand for GPU resources, as well as the challenges and opportunities in the industry. Gui shares how Latitude provides a full stack solution, combining software and infrastructure to offer a seamless user experience. They also talk about the benefits of using Megaport's connectivity services to access Latitude's GPU platform.