Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

PagerDuty Launches New Innovations to Reduce Tool Sprawl and Optimize Operations

The number of tools used by distributed teams to manage incidents has multiplied over the years, leading to a valley of tool sprawl. Throw in manual processes and you’ve got too much toil and multiple points of failure. Maintaining disparate tools and systems isn’t just unwieldy, it’s expensive. Our latest capabilities add to the PagerDuty Operations Cloud to make it easier than ever for teams to consolidate their incident management stack.

Our social resurgence: activating our social media presence to revamp Incident Management

Over the past year, Spike.sh social media activity has been null. As a bunch of shy nerds in a small team working remotely across the world, we really never bothered with social media and our presence on it. We always kept our heads low and maneuvered around it. But no more. As of today, we are coming back on social media channels like LinkedIn, Twitter, and Reddit as well.

Learn How PagerDuty Customers Save Money and Achieve Fast ROI

Saving time and money is always important, but these days, it’s a mission-critical business imperative. At PagerDuty, we help organizations realize transformational gains in efficiency that drive both immediate financial impact and long-term business success. PagerDuty delivers clear value for any organization at any stage of operational maturity. But you don’t have to take our word for it – the real-life experiences of our customers speak volumes.

How Helpdesks Facilitate Major Incident Management

Helpdesks serve as the initial line of defense for IT incidents, responsible for facilitating incident management, including logging, categorizing, and prioritizing incidents. In the event of a major incident, the helpdesk plays a crucial role in escalating the incident to the appropriate major incident management (MIM) team. The success of this process relies on the expertise of the helpdesk staff in providing situational context to expedite resolution.

Building A DevTools Saas Company Today! Incidentally Reliable Podcast | Zenduty

Catch Rajesh Tilwani talking about Building a DevTools SaaS Company, and everything reliability only on the Incidentally Reliable Podcast, live now on all major platforms! About Zenduty: Zenduty is a revolutionary incident management platform that gives you greater control and automation over the incident management lifecycle. With the Zenduty API, you can supplement and deploy Zenduty in sync with other tools and services, allowing you to create and update incidents, users, teams, services, integrations, schedules etc. and automate your workflows using simple scripts.

The fastest and most robust path to incident declaration from monitoring tools

Here’s a crazy question: why do we still require a human to manually declare an incident for the things that we know are incidents? If we have enough confidence to build SLOs and high-severity alert routes for these specific scenarios, why are we still asking a human to confirm it’s an incident and get the assembly process in motion? Isn’t that just another button to push when we could be problem solving instead?

Establishing Zero Trust out of the box at Enterprise scale

At most enterprises CIOs are already multiple waves into enforcing Zero Trust policy across their processes, configurations and teams. As a DevOps Lead, being responsible for juggling user empowerment and adherence to your executive’s policy across many SaaS tools can be tricky. This problem is especially challenging in incident management where highly sensitive data is being shared, incidents rely on multiple different types of team members, and response teams fluctuate from incident to incident.

Insights into Observability Tools: Commercial vs. Open-Source

Observability has become a critical aspect of modern software development and operations, allowing organizations to gain insights into the health and performance of their applications and systems. One of the key decisions when implementing observability is choosing between commercial or open-source tools. We spoke to several professionals who shared their experiences and insights on this topic, shedding light on the pros and cons of each approach.

Process Automation v4.12.0 and v4.13.0 Release Notes

Product Managers Jake Cohen and Forrest Evans are back to update us on what’s new in the 4.12.0 and 4.13.0 releases of PagerDuty Process Automation. New in these releases are features to support #Kubernetes automation, managing resources in multiple #AWS accounts, and a new plugin suite for Sensu.