Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

The Risks Of Using Small Status Page Vendors

Servers are down. Employees are scrambling. Customers are upset. The pressure is on. When internal operations are in disarray, and your business is experiencing a service outage, the last thing you need to worry about is the reliability of your incident communication solution. Keeping users informed when services are down is mission-critical, in order to prevent a flood of support requests, which compound the effects of the incident, straining employee productivity and bandwidth.

PagerDuty and FiberPlane Integration Demo

Presenter: Aparna Valsala, Solutions Engineer at Fiberplane, Using the PagerDuty and Fiberplane integration, the responding engineer can immediately start the investigation using a predefined and configurable Fiberplane template visible to all while allowing multiple engineers to collaborate on the investigation with complete visibility and context.

Causes of Data Center Outages and How to Overcome Them

With the increasing computing requirements and complexity of data center systems, unplanned downtime has become a severe threat to enterprises in terms of process violations, revenue losses, and reputational issues. Although data center failures are quite common, it can be difficult to predict every scenario that might have a severe impact on the expansion of your company. Especially when some factors, like a natural disaster, can simply be beyond your control and result in data center outages.

APIs Impact on DevOps: Exploring APIs Continuous Evolution

An application programming interface (API) is a set of rules and protocols that enables different software applications to communicate and share data and functionality. The concept of an API has been around for a long time. However, APIs as you know them emerged in the late 1990s and early 2000s with the rise of the internet and web-based services. As more businesses began to offer online services, the need for a standardized way for these services to interact and share data became apparent.

How to talk to your executive leadership team about reliability

Product reliability requires investment from all areas of the business. Technology leaders must effectively communicate the implications of service reliability to the rest of the organization. As a leader, how do you prove that a more reliable product is critical to success? Experts from BetterCloud, Machinify and Blameless come together to discuss how to talk to your executive leadership team about reliability in this webinar.