Operations | Monitoring | ITSM | DevOps | Cloud

PagerDuty

See You At The RSA Conference!

At PagerDuty, we’re counting down the days until the RSA Conference! Why? Because, in addition to being excited to see everyone there, we also have lots of new information to share—information in line with this year’s conference theme: Better. More specifically, how to improve security at your organization by having better processes and better collaboration.

Chaos Engineering With Ana Medina

Recently, I sat down with Ana Medina of Gremlin for a PagerDuty Community AMA! Ana is currently working as a Chaos Engineer at Gremlin, helping companies avoid outages by running proactive chaos engineering experiments. Previously, she worked at Uber as an engineer on the SRE and Infrastructure teams, where she specifically focused on chaos engineering and cloud computing. Catch her tweeting at @Ana_M_Medina about traveling, diversity in tech, and mental health.

The Power Of Operational Reviews

Last fall, we introduced PagerDuty Analytics, a product that combines machine and human response data to provide operational insights that enable organizations to drive process maturity and improved business outcomes. Today, we’re excited to announce that it’s generally available! As part of our expanded Analytics product offering, we’re rolling out a set of prescriptive operational performance scorecards.

Container Incidents by Tabletop intro to Real time Security Operations

When suspicious or risky behaviors occur on one of your servers or containers, what can you see and how quickly can you see it? The growing use of complex infrastructure coupled with sophisticated malicious actors requires immediate action when an incident does occur. Preparation is key.

Postmortems Part 2: How to Adopt a Learning Culture

Culture is the way we do things together. It’s the secret sauce that results in happy, healthy teams that consistently meet their goals. It’s also the hardest thing to define, cultivate, and change in an organization. True cultural change requires more than creating and communicating policies. It takes collaboration, persistence, and experimentation.

Introducing The PagerDuty Postmortem Guide

Your team had been fighting this major incident for hours, but your investigation was hitting one dead end after another. Finally, you managed to isolate the problem and your graphs started to improve. When all systems went back to normal, everyone let out a collective sigh of relief, shut down the response call, and went back to bed, never to think of this incident again. Or so you thought.

Video AMA: Ana Medina

Ana is currently working as a Chaos Engineer at Gremlin 10, helping companies avoid outages by running proactive chaos engineering experiments. She last worked at Uber where she was an engineer on the SRE and Infrastructure teams specifically focusing on chaos engineering and cloud computing. Catch her tweeting at @Ana_M_Medina 11 mostly about traveling, diversity in tech, and mental health.