Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Visualize Logs Alongside Metrics: A Complete Guide for Monitoring Slow MySQL Queries

When a service slows down, metrics will tell you that it’s happening but logs tell you why. For MySQL, slow queries can be a silent performance killer, gradually chewing through resources until users start complaining. By enabling MySQL’s slow query log and forwarding it to Loki (via Promtail), you can visualize query-level details right alongside your metrics on Grafana dashboards. This makes it easy to correlate what is slow (metrics) with what is causing the slowdown (logs).

QA Testing in 2025: Revolutionize Your Workflow with Preview Environments

Software quality assurance has changed dramatically over the past few years. Today, the velocity of software development demands more than traditional staging and shared QA environments. Releases are expected to be faster, integration cycles shorter, and quality standards higher. These pressures have inspired a growing interest in preview environments—ephemeral, production-like spaces spun up on demand for testing code changes in isolation.

LTS vs. upgrades: which future are you building for?

How should businesses decide between sticking to an LTS release or moving to a continuous upgrade model? In this episode, we explore the trade-offs, from stability and security to innovation and agility, and why flexibility in your upgrade policy is key to long-term success. We break down when LTS makes sense, when frequent upgrades deliver the most value, and how to balance both to keep your business secure, stable, and ready for what’s next.

Fiber Paths and Failsafes: Why Your Network Design Matters

Redundancy isn’t just a buzzword – it’s the design principle keeping modern AI and cloud applications online. In this Uplink episode, Kevin Schlosser, Interconnection Product Manager at NTT Global Data Centers, explains how resilient infrastructure is engineered to expect failure but remain operational. We explore: Diverse entry points and fiber path management AI-driven bandwidth growth: 100G standard, 400G emerging Cooling innovations for intense compute workloads Why providers without their own fiber may offer the most resilient paths.

Mike Long and DORA Community Discussion - Software Delivery Governance

Manual governance in regulated industries is like steering a ship with last year’s map. Approvals, ticket queues, and after-the-fact evidence collection slow delivery and increase risk. By the time an audit arrives, teams are scrambling to prove they followed the process. Watch Kosli’s Mike join Nathen Harvey at DORA to unpack why this happens — and what continuous, automated governance can do to fix it.

Cortex MCP set up

Learn how to set up the Cortex MCP in under 5 minutes. The MCP integrates directly into your IDE, giving instant access to Cortex data without leaving your coding environment. It reduces context switching by enabling natural questions about services and teams, and streamlines workflows with real-time data from Cortex, Jira, GitHub, and more.

Using Claude to power up your onboarding

I joined incident.io about ten weeks ago, having been in my previous role for four and a half years. Being a new starter was an unusual feeling for me, and there's been a huge amount to learn; but by lunch on my second day (!) I had started shipping value to our customers. A large part of hitting the ground running has been having a colleague alongside me, who I can pester with questions, who doesn’t get offended when I write in all capitals, and often praises me for being absolutely right!

Zero-downtime deployment with Flagsmith and CircleCI

As developers, we continually strive to improve our software. This often means rolling out new software features at a rapid pace. However, deploying new features to production is not without risk. From no real production testing to limited rollback options, traditional deployment can quickly become frustrating. The worst issues, though, usually stem from one thing: buggy features making their way into the hands of users.