Operations | Monitoring | ITSM | DevOps | Cloud

Set Up for Success: Service Taxonomies in PagerDuty

It’s 2:37 a.m. on a Tuesday night, you’re asleep—but it’s also your turn to be on call. You receive a phone call from PagerDuty. Your partner hits you with a pillow in an attempt to wake you up. It worked. You groggily answer the call and hear your favorite robo-guy on the other end of the line.

Integrate OpManager with AlarmsOne and manage your alerts like a pro

OpManager helps enterprises monitor their network, devices, servers, firewall, and more. While all this helps keep your systems up and running at all times, effectively managing alerts is another challenge altogether. If you’re using a number of IT management tools, it can be hard to address issues like alert noise, inflexible on-call scheduling, and escalations.

Accelerating Incident Response With Real-Time Business Data at Wayfair

Like any good e-commerce company, Wayfair collects a significant amount of data to use for business intelligence. Until recently, the majority of this data was crunched off-hours in preparation for business use the next day. We also create a great deal of data about our applications and infrastructure in real time.

Volunteers, Not Conscripts: Fixing Out-Of-Hours On-Call at Intercom

Uptime matters. At Intercom, we believe that keeping our product online and working well at all times is critical to the success of our business. Out-of-hours on-call is inherently disruptive to your life as an engineer. You need to be ready to respond quickly and competently to an alert about something being broken.

Monday Update: Customer Survey, Telegram Integration, Atlassian & Slack, Browser Extensions, and SSL

Our final Monday update of July and although many of our customers are heading off on summer vacation not only are we here monitoring your websites 24/7, but we’ve got some exciting new features and improvements happening over the holiday break.

How to Communicate with Customers During an Outage

Customers are the lifeblood of a successful enterprise. Yet too often, enterprises fail to keep their customers up to date during an outage. In these scenarios, enterprises risk alienating customers and losing them to rivals. To better understand why this may be the case, let’s consider an example.