%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Deploying to production in <5m with our hosted container builder

Nov 18, 2021 By Lawrence Jones In Incident.io

Fast build times are great, which is why we aim for less than 5m between merging a PR and getting it into production. Not only is waiting on builds a waste of developer time — and an annoying concentration breaker — the speed at which you can deploy new changes has an impact on your shipping velocity. Put simply, you can ship faster and with more confidence when deploying a follow-up fix is a simple, quick change.

Read Post

Incident.io

Read more about Deploying to production in

Training Intelligent Alert Grouping

Nov 18, 2021 By Quintessence Anx In PagerDuty

Complex incidents are both exhausting and commonplace. In this case, incidents that I am referring to as “complex” are incidents that involve multiple, disparate, notifications in your alert management platform. Perhaps these incidents are logically separated because the underlying systems or services were seen as less coupled than they turned out to be in reality.

Read Post

PagerDuty

Read more about Training Intelligent Alert Grouping

How to Use Status Page to Deliver Bad News to Customers

Nov 17, 2021 By StatusHub In StatusHub

In this article, we’re exploring how status pages can help you deliver bad news to customers in a “good way,” starting with the psychology of news delivery and how you can use this knowledge for future incidents.

Read Post

StatusHub

Read more about How to Use Status Page to Deliver Bad News to Customers

Fail-Safe Digital Scheduler for On-Call Management

Nov 17, 2021 By OnPage In OnPage

In this video, we discuss how OnPage's advanced, fail-proof digital schedules enable organizations to distribute workload evenly among scheduled, On-Call team members. The OnPage scheduler starts out "FULL" and schedules are created on top of it. This guarantees that a notification is delivered reliably, even when a slot is left empty on the scheduler. The scheduler reverts to the default group order and the entire group is notified, ensuring continuous coverage across your organization.

View Video

OnPage

Read more about Fail-Safe Digital Scheduler for On-Call Management

Viewing Your Contacts on Android - xMatters Support

Nov 17, 2021 By xMatters In xMatters

Join Chris Patch, xMatters’ Senior eLearning Specialist, as he navigates you through the “My Contacts” section of the xMatters app for Android devices.

View Video

xMatters

Read more about Viewing Your Contacts on Android - xMatters Support

Tis The Season: Protect Your Availability During The Holidays

Nov 17, 2021 By Richard Whitehead In Moogsoft

Deck the halls! It's time for the annual holiday Code Freeze, that festive time of year when businesses impose a precautionary halt to code changes and Operations should be quiet. But before you kick up your feet, make sure that demand doesn’t lead to availability embarrassments. After all, retail experts suggest that we’re in for another online-heavy holiday shopping season, so businesses need to brace for increased digital traffic...with little tolerance for failure.

Read Post

Moogsoft

Read more about Tis The Season: Protect Your Availability During The Holidays

Partner Integration on Twitch: Lacework

Nov 16, 2021 By PagerDuty In PagerDuty

Lacework delivers complete #security and #compliance for the cloud. While the cloud enables enterprises to automatically scale workloads, deploy faster, and build freely, it also makes it increasingly difficult to: maintain visibility, remain compliant, stay free from known vulnerabilities, and track activity in both host workloads and ephemeral infrastructure within their environments. Integrate Lacework with PagerDuty to route Lacework Events to responders on your team. Manage and resolve configuration issues, behavioral anomalies, and compliance requirements in a timely manner across your cloud infrastructure.

View Video

PagerDuty

Read more about Partner Integration on Twitch: Lacework

5 ways incidents made me a better engineer

Nov 16, 2021 By Lisa Karlin Curtis In Incident.io

Incidents are a great opportunity to gather both context and skill. They take people out of their day-to-day roles, and force ephemeral teams to solve unexpected and challenging problems. In my career, I've found incidents can be a great accelerator - for both myself and others around me. It was after leading my first incident at GoCardless that I started to feel really comfortable in the codebase and the team.

Read Post

Incident.io

Read more about 5 ways incidents made me a better engineer

Fall 2021 Launch: Automate Incident Response to Accelerate Critical Work

Nov 16, 2021 By PagerDuty In PagerDuty

Modern businesses are digital businesses—so managing your business means mastering your critical services and operations for your employees and customers. Today, you need to be able to understand every aspect of your company—as it unfolds—because in this world, seconds matter to your productivity, your revenue, and most importantly, your customers.

Read Post

PagerDuty

Read more about Fall 2021 Launch: Automate Incident Response to Accelerate Critical Work

IT Failures are Inevitable

Nov 15, 2021 By xMatters In xMatters

As infrastructure stacks grow increasingly complex and involve an ever-growing number of services, system failures are becoming more and more common. There can be a variety of reasons why systems fail: software bugs, misconfiguration or interactions between services that cause unexpected behavior, the network is down, and of course, those rare occasions where natural events can render data centers inoperative.

Read Post