Operations | Monitoring | ITSM | DevOps | Cloud

Unlocking the Value of your Runbook Automation Value Metrics with Snowflake, Jupyter Notebooks, and Python

This blog was co-authored by Justyn Roberts, Senior Solutions Consultant, PagerDuty Automation has become an integral piece in business practices of the modern organization. Oftentimes when folks hear “automation,” they think of it as a means to remove the manual aspect of the work and speed up the process; however, what lacks the spotlight is the value and return automation can offer to an organization, a team, or even just one specific process.

Simplify customer support with Datadog's integrations for Zendesk

Zendesk provides support teams with an integrated solution for processing all types of customer inquiries and feedback. But as organizations scale, support tickets multiply, making it increasingly difficult to parse all of your customers’ feedback and time-consuming to investigate issues. Customers often report issues without providing the detailed context needed for troubleshooting, creating unclear and indirect paths to remediation.

Home Assistant Tutorial: A Beginner's Guide to Automation

In this post, we’ll be taking a closer look at Home Assistant, an open source platform for connecting your smart devices at home. We’ll walk through every important section of Home Assistant: dashboards, integrations, add-ons, devices and entities, automation, scripts, and scenes. In addition, we’ll be walking through how to set up your Home Assistant and create automation using Home Assistant’s graphical user interface.

Escaping the Cost/Visibility Tradeoff in Observability Platforms

For developers, understanding the performance of shipped code is crucial. Through the last decade, a tablestake function in software monitoring and observability solutions has been to save and track app metrics. Engineers love tools that get out of your way and just work, and the appeal of today’s best-in-class application performance monitoring (APM) suites lies in a seamless day zero experience with drop-in agent installs, button click integrations, and immediate metrics collection.

Rancher Live: What's the buzz with Cilium?

The Cilium community has had some truly buzzworthy accomplishments (pun intended!) in the past year - from hosting the first ever CiliumCon in Amsterdam to becoming a CNCF graduated project! In this first episode of Rancher Live for 2024, we will be joined by the community pollinator for Isovalent, Bill Mulligan. Together, we will be diving into the how-tos of creating "hive"-ly orchestrated container workloads that are as sweet as honey!

Teams Call Quality Dashboard: The First Step In Teams Insight

Do you have much experience using the Call Quality Dashboard (CQD)? Does your team go on about it being a ‘good starting point’? Do you even know what the CQD does? Fear not. If you answered ‘no’ to any of those questions, we’re going to fill you in with all the details that matter and give you some additional direction on how to get the most out of them. The bottom line is they’re a good starting point, but a long way from being a proactive performance solution.

Paving the Road for Proactive Reliability

At Expedia Group, Kaushik Patel and Nikos Katirtzis have thousands of engineers and micro-services. Heterogeneity in terms of infrastructure and technologies used over the years created inefficiencies and posed the need for a set of automated best practices for our engineering teams. Over the past 2 years, using a data-driven approach, we’ve worked on creating a set of platforms that helps teams to adopt good reliability practices, including chaos engineering, release safety, or automatic failover between cloud regions. In this talk Kaushik and Nikos will cover the platforms they’ve built, including how they used data to drive their investment decisions.