Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Observability vs. Monitoring: How Do They Work?

As organizations increasingly depend on distributed system architectures to provide modern applications and microservices, their legacy monitoring tools struggle to keep pace. These outdated systems are often based on predictable failures, but when an unforeseen performance issue occurs, it can lead to outages and unplanned downtime that impacts your customers and your business.

Future Trends In Kubernetes Cost Management: What To Expect

Kubernetes has emerged as a pivotal force in shaping modern cloud infrastructure. Originating as a brainchild of Google, Kubernetes has evolved into an open-source platform that has revolutionized how applications are deployed, scaled, and managed across a vast network of machines. Its ability to orchestrate containerized applications efficiently makes it an indispensable tool within cloud computing. However, with great power comes great responsibility, particularly in the realm of cost management.

FireHydrant is now AI-powered for faster, smarter incidents

Over the last five years we’ve seen our customers run 583,954 incidents more efficiently thanks to a shared workspace, powerful Runbook automations, and auto-captured data. Yet despite a great deal of progress, incident efficiency hasn’t achieved peak potential. We talk to a lot of folks that are still stuck in the muck: new responders struggle to get up to speed quickly, incident commanders wade through post-incident drudgery, and knowledge silos prevent comprehensive improvements.

Fault Injection in your release automation

One of the real successes of the Agile Software development movement has been the push to have regular, frequent deployments. This has manifested as build and deployment automation and the general adoption of CI/CD. As engineers automate more processes of their software release lifecycle, an important question is how to automate Quality Assurance, which includes resilience testing and, more specifically, Fault Injection.

Linux CPU Utilization - How To Check Linux CPU Usage

CPU utilization is a crucial metric for measuring system performance and identifying potential bottlenecks in Linux systems. This article explores the concept of CPU utilization, factors contributing to high CPU usage, and various command-line tools and graphical utilities for monitoring and troubleshooting CPU utilization in Linux environments.

Argo CD and Codefresh GitOps Security Updates 3/18/2024 - Preventing Brute-Force and Denial of Service

In September 2023, security researchers from KTrust reported three issues through the official Argo CD security disclosure channels in accordance with Argo CD security policy. In coordination with other Argo maintainers, we have issued security updates for both Argo CD and Codefresh GitOps (enterprise Argo). Below you can read more about these CVEs, their impact, and mitigation.

Forward and reverse DNS lookups: What they are, why you need them, and how to configure them

Effectively managing the dynamics of domain name lookups through the DNS is crucial for boosting the speed and security of network connections. Forward and reverse DNS lookups, the yin and yang of network connections, translate human-friendly domain names into machine-readable IP addresses and vice versa, ensuring secure connections within both public and private networks.

Containerization and DevOps Optimizing Deployment with Docker and Kubernetes

DevOps practices have revolutionized how teams build, test, deploy, and manage applications, enabling rapid delivery without compromising quality. Central to this paradigm shift are containerization technologies like Docker and orchestration platforms like Kubernetes. In this article, we’ll explore how containerization and DevOps intersect, and how leveraging Docker and Kubernetes can optimize deployment processes.

Strategies for Scaling Systems Reliably by Bob Lee

I was out there in sunny Austin this February, speaking at Civo Navigate 2024. The event was jam packed with amazing talks, and it was great meeting so many people with long and fascinating careers in engineering and Site Reliability. I had the privilege of meeting Bob Lee, who currently leads DevOps at Twingate — a cloud-based service that provides secured remote access, and poised to replace VPNs.