Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Infrastructure Monitoring With Amazon CloudWatch and OnPage Integration

Digitalization of business has transformed the world and its industries. Software that upkeep digital initiatives are no longer categorized as a support function. Rather, they are integral to every business process. Modern organizations require infrastructure monitoring tools to detect anomalies and alerting systems to automate remediation processes.

Splunk On-Call: New Name, New Features to Improve On-Call For Your Teams

Today, more than ever, mobilizing remote teams to triage and resolve outages separates is separating enterprises able to accelerate their digital initiatives from those who don’t. Observability has elevated our ability to quickly detect problems and ask questions in our system to triage and reduce “time to clue” — an increasingly important metric.

How to perform incident management with ServiceNow and Elasticsearch

Welcome back! In the last blog we set up bidirectional communication between ServiceNow and Elasticsearch. We spent most of our time in ServiceNow, but from here on, we will be working in Elasticsearch and Kibana. By the end of this post, you'll have these two powerful applications working together to make incident management a breeze. Or at least a lot easier than you may be used to!

How PagerDuty and Slack Empower the "Work Where You Are" Mindset

Our reliance on digital services continues to be heightened by the ongoing COVID-19 pandemic. For work, school, and play, digital remains the primary channel. This puts huge pressure on ITOps and DevOps teams, making it critical that they can collaborate easily to resolve incidents rapidly. Many modern ITOps and DevOps teams rely on one of PagerDuty’s key integration partners, Slack, to meet this need.

Digital Retail Tips: Reduce Downtime on Black Friday (and Cyber Monday)

Black Friday is one of the biggest days of the year for online consumers and retailers alike. This year, the coronavirus (COVID-19) pandemic is reshaping Black Friday shopping — and digital consumers and retailers must plan accordingly. The coronavirus pandemic will likely cause Black Friday shopping to decline this year. As such, many digital retailers are launching early Black Friday sales, so they can capture consumers’ interest ahead of the competition.

Five worthy reads: Preparing an incident response plan for the pandemic and beyond

Five worthy reads is a regular column on five noteworthy items we’ve discovered while researching trending and timeless topics. With the rising concern over cyberattacks in the distributed workforce, this week we explore the concept of cybersecurity incident response during a pandemic.

Delivering Always-On Digital Experiences in Retail

How is it already near the end of October? We know our retailer customers have been heads-down thinking about code freezes and hypercare during the high season as we approach the holidays. Disruption and pivoting quickly to meet changing customer expectations is nothing new to the retail industry.

"The clearest and most singular footprint for AIOps": BigPanda named leader once again in EMA's Radar Report on AIOps

Leading analysts continue to acknowledge BigPanda’s leading role in the AIOps ecosystem. Earlier this year the GigaOm AIOps Radar report placed BigPanda in the leader section for the company’s strong market impact as a platform that delivers event correlation at scale.

How to connect ServiceNow and Elasticsearch for bidirectional communication

The Elastic Stack (ELK) has been used for observability and security for many years now, so much so that we now offer the two as out-of-the-box solutions. However, identifying issues and finding the root cause is only part of the process. Often, organizations want to integrate the Elastic Stack into their everyday workflows so they can resolve those issues quickly. This typically involves integrating with some form of ticketing/incident tracking framework.