Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

SecOps tools - SecOps & incident management for 2022.

Importance of secOps tools – The threats in the cyber world are becoming more and more complicated and sophisticated with each passing day, while the rapid expansion of digital operations, with more nodes, networks, and servers has resulted in more vulnerabilities. This situation demands efficient SecOps teams as well as practices so that threats are thwarted, and networks and data are always protected. What is SecOps & Best SecOps tools?

AWS outage? A better way to monitor outages in Amazon Web Services

Amazon Web Services (AWS) needs no introduction. It's one of the most popular services in the world. Or actually, the most popular cloud infrastructure provider (34%) according to this study. Like in any other service, there are outages. For people running their infrastructures, there's a good chance that outages have impacted your business in the past. And the reality for AWS (or any other service) is that there's a good chance it will happen again.

360º Fireside Chat with PagerDuty, Lisbon's Newest Tech Employer

Joining João Freitas, GM & Engineering Site Lead in Lisbon for this 360º Fireside Chat about PagerDuty’s projects, challenges, the technical parts of PagerDuty and how everything comes together and where we are today and where we’re going in the future.

A deeper dive into the Rogers outage

Beginning at 8:44 UTC (4:44am EDT) on July 8, 2022, Canadian telecommunications giant Rogers Communications suffered a catastrophic outage taking down nearly all services for its 11 million customers in what is arguably the largest internet outage in Canadian history. Internet services began to return after 15 hours of downtime and were still being restored throughout the following day.

Outage Alert: Top 5 Outages of Q2 2022

We are halfway through 2022 and one thing is certain – downtime is here to stay. In fact, trends are showing the frequency of downtime is increasing, along with the severity and wide-spread impact. Consumers and businesses are more interconnected and reliant on technology and software than ever, from remote business communication to simply listening to your favorite podcast on your way to work.

Promoted to SRE Advocate: A Dream Turned Reality

I get chills thinking about a line from the first film adaptation of Roald Dahl's Charlie and the Chocolate Factory, Gene Wilder as Wonka nearly whispers it to Charlie, as if it is secret information: We are the music makers, and we are the dreamers of dreams. For me, the quote (taken from a poem by Arthur O'Shaughnessy) is austere: We are the creators of what we create, and what we create becomes what we are.

We've raised $34M to help organisations be resilient in the face of failure

TL;DR: We’ve raised $34M to bring increased resilience to organisations around the world. With this latest round of investment we’re expanding internationally in the US, accelerating our product plans, and growing our amazing team 🎉 As technology becomes more complicated and runs an ever greater part of our lives, failure becomes more inevitable, and more costly.