Background We recently released the biggest overhaul to one of the core features of Spike.sh - On-call schedules. Software teams use on-call schedules to designate first responders who will handle issues when they occur.
On-call planning is one of the most popular features in Enterprise Alert and is widely used by users, team managers and administrators. However, in our discussions we keep finding that it is not simply done with 5 minutes of planning. Scheduling often depend on external systems. This can range from a simple excel form provided to HR all the way to a comprehensive billing system such as SAP. As a result, it takes a quite a bit of time to transfer the planned shifts to third-party systems.
We’re excited to present a feature update to the OnPage platform. The new update will bring more flexibility and resiliency to a team’s on-call management workflow. With the new scheduling capabilities, OnPage system administrators can create exceptions to configured, recurring on-call schedules.
An on-call schedule tells you and everyone in the team who will be the first responder when an issue happens in production. The on-call team member is responsible for investigating the issue, either fixing the issue herself or adding other people who can help fix it. Having an on-call schedule is important for building reliable systems because making someone responsible for production issues makes sure that they're not ignored.
The always-on, always-available expectations of digital services have increased the requirements of technical teams to be ready and provide response around the clock. For teams new to this concept, introducing on-call can be stressful and complex. As part of PagerDuty’s main platform, on-call management is key to our business, but the non-technical aspects are also important for teams to consider.
We’re excited to announce a new set of product updates and enhancements to the PagerDuty platform! Our latest release expands Change Impact Mapping integrations and experiences, gives access to the Visibility dashboard for Business plans, improves on-call processes and analytics, and advances incident response automation so teams can work more efficiently during the moments that matter.
Today, more than ever, mobilizing remote teams to triage and resolve outages separates is separating enterprises able to accelerate their digital initiatives from those who don’t. Observability has elevated our ability to quickly detect problems and ask questions in our system to triage and reduce “time to clue” — an increasingly important metric.
If you are still handing over a shared on-call duty phone or pager (sometimes called ‘operations phone’), it is time to rethink your process. The Covid19-induced new normal has a dramatic impact on our work live and social behavior. We work from home and that is especially true for the IT workforce. We meet with less people and limit our social network to relatives and close friends.