Operations | Monitoring | ITSM | DevOps | Cloud

November 2021

PagerDuty at AWS re:Invent 2021-Deepening Our Collaboration with AWS

Across the globe, in-person technology events are beginning to emerge from their pandemic hibernation. For developers and DevOps teams, no event has been more anticipated than AWS re:Invent, which is back in Las Vegas, November 29th — December 3rd to help bring us all back together and slowly let us find our new normal. While handshakes may be replaced by elbow bumps or other newfound greeting rituals, we are excited to be back and see all of you in real life.

Partner Integration - Dynatrace with PagerDuty and Rundeck

Deliver perfect software experiences with real-time intelligence into customer satisfaction and behavior, your applications, and the performance of your hybrid multi-cloud. AI-powered root-cause analysis automatically identifies customer facing performance issues and pinpoints the root-cause within seconds. Open APIs allow ingestion of 3rd party metrics and enable complex system integrations. In this demo, Rob Jahn shares a sophisticated incident remediation workflow incorporating intelligence from Dynatrace, automation in Rundeck, and incidents in PagerDuty.

4 Ways To Ensure Reliability of Your Digital Services for GivingTuesday

In today’s digital economy, seconds matter. For mission-driven organizations, seconds can be a matter of life and death, and service reliability can make or break access to suicide and safety hotlines, disaster relief, time-critical health care, food assistance, and more. That’s where real-time digital operations comes in.

Training Intelligent Alert Grouping

Complex incidents are both exhausting and commonplace. In this case, incidents that I am referring to as “complex” are incidents that involve multiple, disparate, notifications in your alert management platform. Perhaps these incidents are logically separated because the underlying systems or services were seen as less coupled than they turned out to be in reality.

Fall 2021 Launch: Automate Incident Response to Accelerate Critical Work

Modern businesses are digital businesses—so managing your business means mastering your critical services and operations for your employees and customers. Today, you need to be able to understand every aspect of your company—as it unfolds—because in this world, seconds matter to your productivity, your revenue, and most importantly, your customers.

Partner Integration on Twitch: Lacework

Lacework delivers complete #security and #compliance for the cloud. While the cloud enables enterprises to automatically scale workloads, deploy faster, and build freely, it also makes it increasingly difficult to: maintain visibility, remain compliant, stay free from known vulnerabilities, and track activity in both host workloads and ephemeral infrastructure within their environments. Integrate Lacework with PagerDuty to route Lacework Events to responders on your team. Manage and resolve configuration issues, behavioral anomalies, and compliance requirements in a timely manner across your cloud infrastructure.

Monitoring & Observability for Sales, Marketing and Business ops teams with StackMoxie and PagerDuty

Before Stack Moxie, every business ops team needed PagerDuty, but finding and pushing errors was a manual process. With Stack Moxie + PagerDuty, every business op professional can manage their sales, marketing, HR or customer success stack with the same quality engineers bring to code.

New Tech Leader Survey Reveals Why the Time for Real-Time Operations is Now

“Customer obsessed.” “Customer-centric.” “Customer-first.” For CEO’s everywhere, setting and maintaining a coordinated focus on the customer has become a top priority when driving innovation. After all, for many organizations regardless of industry, digital customer experiences are what can make or break the bottom line.

New Apps for PagerDuty's Datadog Integration

Status Dashboard by PagerDuty and Incidents by PagerDuty are new apps available now in Datadog. See a live, shared view of system health to improve awareness of operational issues with Status Dashboard by PagerDuty. Acknowledge, troubleshoot, and resolve incidents with PagerDuty actions embedded directly in the Datadog interface to limit context switching among tools. Julia Nasser and Hadijah Creary join the stream to show off this powerful enhanced integration.

Make sense of complex systems with Dynamic Service Graph by PagerDuty

The Dynamic Service Graph breaks down silos between teams and provides organizations with a living, breathing asset that displays technical and business services and their relationships at scale. It allows teams to quickly grasp the state of services, visually digest the full impact radius of an issue, zero in on likely cause, and seamlessly facilitate cross-team collaboration.

Improve Your Automation and Reduce Toil

In the course of your day as an SRE, or DevOps, or SysAdmin, your knowledge and expertise are in high demand. You can’t do every task every person in your org needs from you without the help of comprehensive automation. Automation can be tricky. Some systems aren’t built with automation in mind, but assume that a human being will be there to keep an eye on things and fix errors on the fly, and we can’t be everywhere when there’s too much to do. Plus, you want to provide access to automation for the right folks and keep a record of when the tools were used.

Leaning on Technology in The New Noisy: Managing Cloud, Change and Risk

Your company’s “digital transformation” will be driven by new application designs and methods, new technology stacks, and new processes. To master it, and delivering next generation services through it, massively complex sets of signals and data need to be leveraged, processed, and acted on. Developers need integrated data and insights through that noise, while being able to leverage their tools of choice. All of this must be managed, even in spite of massive rates of change and innovation.

Visualize and manage all of your services in one place with Dynamic Service Graph

In this digital era, technology systems are becoming increasingly complex. No longer can a single SME (subject matter expert) understand every facet of the system they run. Instead, much of this knowledge is siloed and exists as tribal knowledge within certain teams. Additionally, the rate of change is faster than ever, with code deploying and new services shipping at a rate unimaginable a few years ago.

What's New in the PagerDuty Terraform Provider - PagerDuty Garage (Oct 29, 2021)

The Terraform PagerDuty provider is a plugin for Terraform that allows for the management of PagerDuty resources using HCL (HashiCorp Configuration Language). Manage your PagerDuty account with Infrastructure as Code. #infrastructureascode For more info on the PagerDuty provider for #Terraform, see the documentation on the Terraform Registry.

How service ownership can help you grow your operational maturity

Digital operations management is about harnessing the power of data to act when it matters the most. It’s also about having the right processes and procedures to support teams when every second is critical. Maturing your digital operations takes time, iteration, and commitment. The change won’t happen overnight. But, if you put in the effort, you’ll reap outsized benefits. You’ll be able to learn from incidents and proactively improve your services over time.