Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Argo Rollouts Canary Monitoring: Metrics, Gotchas, and Automated Gates with Last9

Argo Rollouts exposes Prometheus metrics on port 8090 — but the docs lie about which labels exist. Here's how to scrape them into Last9, build a canary dashboard, and use Last9 as an automated AnalysisTemplate gate, including the auth and base64 gotchas. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

All You Need to Know About CrashLoopBackOff Error

Kubernetes is an open-source container orchestration engine that is used to automate containerized application deployment, scaling, and administration. It is an open-source management platform that can be used to manage containerized workloads and services, as well as declarative configuration and automation. Kubernetes is a framework for running distributed systems in a resilient manner. It handles scaling and failover for your application and provides deployment patterns and other features.

Eliminate Manual Authentication Configuration for Fast & Effective API Security Scanning | Harness Blog

Application security testing tools promise coverage and accuracy, but teams often struggle just to get started. One of the biggest friction points in dynamic application security testing is configuring authentication correctly so a scanner can even access a target application, let alone API endpoints that power the functionality. Whether it’s API keys, bearer tokens, or custom auth flows, setting up authentication for scans frequently requires trial-and-error and engineering support.

What is AI SRE? The Complete Guide to AI-Assisted Site Reliability Engineering

It's 2:47 AM. PagerDuty fires. You open a Slack alert and see: p99 latency spike on checkout-service. You SSH into the host, check dashboards in four tabs, grep logs for the last 20 minutes, and eventually find a slow query introduced in a deploy six hours ago. It took 34 minutes. You resolved it, w Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

Capturing HTTP Request and Response Bodies in .NET Traces with PHI Redaction

> Standard OTel.NET instrumentation captures headers, status codes, and timing — not request or response bodies. Here's how to add body capture to your traces while keeping PHI out of your observability backend. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

Kubex Named a 2026 Leader by GigaOm

Industry analyst recognition means something different from an award. GigaOm does not hand out trophies. They evaluate products against a defined capability framework and tell the market where vendors actually stand. By that measure, Kubex has been named a Leader in two of GigaOm’s 2026 Radar Reports: Kubernetes Resource Management and Cloud Resource Optimization. In the Kubernetes report, we are positioned as an Outperformer. In Cloud Resource Optimization, a Fast Mover.