Operations | Monitoring | ITSM | DevOps | Cloud

PagerDuty Is for People: Supporting Our Community During COVID-19

Yesterday, we released our earnings during an unprecedented time for society and the market. One of the things I noticed was the collective empathy we experienced as we talked to different teams and companies in preparation, and in our analyst call backs, where to a person, everyone kicked off their call by wishing each other good health and safety. It reminded me that when we are all in this together, not only are great things possible, but it also feels less daunting and more manageable.

Custom Alerts Using Prometheus Queries

Prometheus is an open-source system for monitoring and alerting originally developed by Soundcloud. It moved to Cloud Native Computing Federation (CNCF) in 2016 and became one of the most popular projects after Kubernetes. It can monitor everything from an entire Linux server to a stand-alone web server, a database service or a single process. In Prometheus terminology, the things it monitors are called Targets. Each unit of a target is called a metric.

How SIGNL4 supports geolocation and GPS information

SIGNL4 provides great support for geolocation information and in multiple ways. When a new alert with geolocation information is displayed in the mobile app, the app renders a map to visualize geographic information of the incident. A double click allows to open the default map application on the mobile device, e.g. to get directions or traffic information.

How We Use PagerDuty for Emergency Response

PagerDuty is known as the platform for driving real-time work, and with the current global spread of COVID-19, many of our customers have been asking how we leverage PagerDuty internally to intelligently coordinate a response to emergency situations (such as this) as they arise. PagerDuty customers primarily leverage our platform for coordinating an incident response process when technical issues happen, such as a bad deployment, network degradation or failed hardware.

Announcing Ticketing

Incidents come up quickly and tracking critical tasks to be done in the moment and after an incident is resolved it can be challenging to keep up with what was done by who during an incident and what tasks still need to be completed. In an effort to continue simplifying your incident response process today we are happy to announce an overhaul of ticketing and task tracking on FireHydrant along with a major overhaul of our JIRA integration.