Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

How do you build resilient systems to manage the IPL with 30+ million concurrent users?

The Indian Premier League is a unique sporting event for a dozen reasons. But for engineers in India, it’s one of a kind. Very few companies can boast of managing 30+ million concurrent users. Every year, this number grows. Last year, we witnessed ~60 million concurrent users. And things get bigger and larger every year.

The Debrief: AI can help you never forget incident follow-up actions again

Noting follow-up actions is really important at the end of the incident response process. The problem is that it can be really easy to overlook certain actions or forget to do them entirely. With Suggested Follow-ups, this is now a thing of the past. In this episode, you'll hear from Rob, the project lead for our latest Suggested Follow-ups feature, to get a peek behind the curtain.

How Complyt is using Datadog APM and distributed tracing to reduce application response times

Learn how Complyt is using Datadog Application Performance Monitoring (APM) and distributed tracing to turn data into knowledge and reduce application response times by more than 80%, which enabled them to meet SLAs for their largest customers.

SigNoz Launch Week - Day 4 - Logs Pipeline

For day 4, we will showcase the recent work we have done in Logs Pipeline. With Log Pipelines, you can transform logs to suit your querying and aggregation needs before they get stored in the database. Pipelines provide a way to modify the structure and content of log data without needing to change application code or redeploy components. By extracting relevant attributes from logs, pipelines enable more efficient analysis.

The Unplanned Show, Ep. 29: Major Incident Management with Davis and Chris

Not all incidents are created equal. How do you handle major incidents so that they don't spiral into a chaotic mess, incinerating productivity across too many teams? How do you prevent major incidents and learn from the ones you've had? "Major Incident Management" has been a practice for a long time, but as companies depend even more on digital services and revenue channels, while trying to do more with the same or less, something has to change.