Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Automating Incident Callouts for Canadian Pacific's Engineering Team

Canadian Pacific (CP) is a historic Canadian Class I railroad incorporated in 1881. It was CP that connected the country and became Canada’s first transcontinental railway. Headquartered in Calgary, Alberta, it owns approximately 13,000 miles of track across Canada and the United States. Canadian Pacific initially introduced Enterprise Alert in 2016 to increase speed and effectiveness of incident callouts to information workers, and staff in various departments.

Curb alert noise for better productivity : How-To's and Best Practices

On the quest to provide the best uptime, software platforms depend on complex interconnected microservices. This often leaves them vulnerable to cascading failures creating a massive deluge of alerts from monitoring tools when things go wrong. In this blog, we explore how Squadcast can be configured to curb alert noise for better productivity with the help of the most advanced deduplication features.

How to create a custom ServiceNow incident report dashboard in Canvas

Welcome back once again! This is the third and final part of this series on using the Elastic Stack with ServiceNow for incident management. In the first blog, we introduced the project and set up ServiceNow so changes to an incident are automatically pushed back to Elasticsearch. In the second blog, we implemented the logic to glue ServiceNow and Elasticsearch together through alerts and transforms as well as some general Elasticsearch configuration.

Public Team Calendars

Today, we are excited to announce PagerTree has added support for public calendars! Public calendars allow you to share a team’s on-call calendar with the rest of the world. Public Calendars are available on our Pro and Elite pricing plans. If you don’t already have an account, sign up for a free-trial now. By default, all calendars are private, so to make use of this feature you must enable it.

An introduction to Mattermost as your DevOps Command Center

Mattermost is a platform based on collaboration — not built simply for facilitating team and asynchronous communication, but built on the philosophy that having the ability to collaborate efficiently makes the world safer and more productive for everyone. This is true in many day-to-day situations in an organization, but it is especially true in the world of DevOps. When an emergency arises, information needs to be moved from person to person and team to team as quickly as possible.

How Expedia modernized operations on one of the world's most fastest-moving IT stacks

It’s not everyday we are given a chance to get a first-hand look at how one of today’s leading and most advanced enterprises operates its IT stack. That’s why we were very excited when three senior IT executives from Expedia accepted our invitation to participate in a webinar discussing the company’s IT modernization journey.

Build Organizational Trust With PagerDuty Business Response

Imagine the following scenario: A large retailer experiences a major IT incident that impacts their point-of-sale systems. Their on-call engineers are alerted to the issue and begin their work to resolve it immediately. Behind the scenes, teams are collaborating on a fix, but in the storefront, frustration and tension are growing. Customers are complaining about not being able to check out, and in-store personnel have no good answers as to why the outage happened—or when it will be resolved.