Operations | Monitoring | ITSM | DevOps | Cloud

Latest Blogs

Mattermost Recipe: Handling Incidents with Mattermost and PagerDuty

Here’s the next installment in our Mattermost Recipes series. The goal of these posts is to provide you with solutions to specific problems, as well as a discussion about the details of the solution and some tips about how to customize it to suit your needs perfectly.

Restricting CFEngine to one CPU core using Systemd

In some performance critical situations, it makes sense to limit management software to a single CPU (core). We can do this using systemd and cgroups. CFEngine already provides systemd units on relevant platforms, we just need to tweak them. I’m using CFEngine Enterprise 3.12 on CentOS 7, but the steps should be very similar on other platforms/versions.

Server monitoring best practices: 9 dos and don'ts

Have you ever had responsibility for an application and been the last to know about an outage? I have, and it’s terrible. You go to check your phone in the morning over coffee, after waking up, and you see a flood of missed calls and tons of emails. Customers are angry. Your boss is demanding to know what’s happening. Even the company’s executives are involved. How did this happen?

Tackling the top four challenges of Azure SQL Database monitoring

With large enterprises increasing their focus on public cloud providers, Microsoft Azure continues to have a strong foothold in the hybrid cloud industry. Azure adoption increased a whopping 11 percent last year from 34 to 45 percent, reveals the latest survey by RightScale.

The Serverless Revolution: Why and How The Movement Will Allow Teams to Deploy With More Velocity and Confidence

Serverless or Function-as-a-Service (FaaS) design patterns have been picking up steam. With the recent release of KNative from Google Cloud, let’s take a closer look at the serverless movement.

The State of Operations Health in the World of DevOps

At PagerDuty, we believe the best way to truly understand the health of your employees is to leverage the real-time human data that is already flowing through your systems. PagerDuty’s platform for action and real-time IT Operations orchestration consists of multiple facets and interlocking capabilities.