Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Managing Squadcast resources with our expanded Terraform provider

Hey folks! We’re excited to announce that we’ve vastly expanded the capabilities of our Terraform provider. Previously, our Terraform provider was limited to creating and managing services as a resource. We have now covered the entire spectrum of resources available on Squadcast right from creating and managing users, escalation policies and also managing SLO’s via our Terraform provider. What does that mean for you?

When Can A Service Not Be a Service?

If you’re familiar with PagerDuty, you probably associate it with alerts about technical services behaving in ways they shouldn’t. Maybe you yourself have been notified at some point that a service wasn’t available, was responding slowly, or was returning incorrect information. That’s the common use of a service in the PagerDuty platform.

Intro to Grafana Incident

In this video, you’ll learn how Grafana Incident offers a complete incident management process out of the box in Grafana Cloud, so you can save time and focus on what’s important when things go wrong. Grafana Incident is available to all free and paid Grafana Cloud users. If you’re not already using Grafana Cloud — the easiest way to get started with observability — sign up now for a free 14-day trial of Grafana Cloud Pro, with unlimited metrics, logs, traces, and users, long-term retention, and premium team collaboration features.

How to Run a Post-Mortem Meeting: Tips, Tricks & Checklist

Meetings are a necessary evil in any workplace. They can be long, tedious, and often unproductive. But post-mortem (PM) meetings are different. They are one of the most valuable meetings a service-oriented organization can have. Post-mortem meetings are an essential part of any project manager's toolkit. They provide an opportunity to reflect on what went well and what could be improved upon in future projects.

PagerDuty Apps for AWS + Automated Diagnostics Demo

Reduce downtime and customer impact with service ownership while enabling teams to drive continuous improvement and innovation Learn about how you can modernize and optimize your operations with our enterprise-grade set of AWS integrations. Automate incident response with PagerDuty’s Runbook Automation and learn about our new set of AWS plugins and prebuilt jobs that make it easier to get up and running with auto-diagnostics.

Upgrade your shopfloor alerting with Derdack

Over the last couple of months and service releases, we made continuous efforts to enhance Derdacks capabilities to collect, aggregate and alert shopfloor incidents for our Industry customers that primarily use OPC for alerting. In the accompanying projects, we made big improvements to our OPC Integration even added additional features. The OPC integration received a complete overhaul of the configuration and data management systems and can now handle OPC UA Alerts&Conditions.