Operations | Monitoring | ITSM | DevOps | Cloud

Embracing failure and chaos to improve system reliability and SRE team performance

In this interview with Alex Hidalgo, Field CTO at Nobl9 and author of Implementing Service Level Objectives (O’Reilly Media), we explore how traditional metrics like MTTR and MTTx can give a false sense of reliability. Alex shares how SRE teams can embrace failure, build psychological safety, and design systems that reflect the human factor behind uptime, outages, and real-world reliability.

Monitor OCI spend, AI in DDSQL Editor, OTLP Metrics API, and more | This Month in Datadog

See how you can gain insights into cloud costs by tracking OCI spend and easily comparing instance types in October’s episode of This Month in Datadog. Join us for a spotlight of Cloud Cost Management’s support for Oracle Cloud Infrastructure, and the product’s new feature, Instance Explorer, which enables you to visualize and easily compare the cost and performance of instances across AWS, Azure, and Google Cloud.

AWS & Splunk: Accelerating Innovation Through Partnership

Discover how AWS and Splunk are pushing the boundaries of innovation to empower your security, observability, and cloud transformation journey. This video highlights our joint commitment to driving digital resilience through unified visibility, faster threat detection, and seamless integration across AWS services.

Why AI Coding Assistants Fail (And How to Fix Them)

Why do developers stop using AI coding assistants? According to Carnegie Mellon research, the top reason is unhelpful suggestions. Tabnine's Principal Architect John Feeney explains how context transforms AI coding tools from generic to genuinely useful. Learn the 4 Cs framework for maximizing AI assistant value: Context (workspace indexing), Connection (repo integration), Coaching (rules-based guidance), and Customization (fine-tuning). Discover how Retrieval Augmented Generation (RAG) helps AI understand your codebase, not just open source patterns.

Rovo AI: Create Work Items from Loom | Demo Den | Atlassian

Ever wish you could turn a quick Loom recording into Jira work items without all the manual typing? Now you can! In this Demo Den episode, Pierre walks through a new Rovo AI feature that automatically converts your Loom videos into actionable Jira work items. Whether you're recording bug reports, feature requests, or project updates, Rovo handles the data entry for you. What Pierre covers: Turning Loom videos into work items with Rovo How it works in your AI-enabled Jira instance.

Discover resources smarter with deep discovery in internet services

Discover how Deep Discovery from Site24x7 simplifies your website monitoring by automatically detecting, grouping, and managing all related resources—so you don’t miss a thing. In this video, we walk you through a real-world use case, the problems Site24x7 solves, and how its time-saving features like Bulk Addition make managing multiple monitors effortless. Whether you’re tracking SSL, DNSs, APIs, or website performance, Deep Discovery gives you complete visibility without manual hassle.

The one where we scaled

From 3 people in 2020 to 93 in 2025—incident.io has come a long way, and we’re just getting started. Whether you’ve been here since the early days or just joined, this is what it looks like to build something great *together*. If you're after:️️ Great people Real impact (across the globe, not just in Greece) A place where growth is the default And teammates who’ll always be there for you... We’re hiring! (And we're going to need a bigger couch…)