Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

PagerDuty + Atlassian: Taking Modern Incident Response in Stride

In order to meet rising customer demands and the expectation of “real time, all the time,” digital operations is changing the way people work. And one of the most interesting macro trends is seeing how that impacts not just your IT Operations and Development teams, but also how the entire business is becoming involved in raising the level of responsiveness to customers.

Building Automated Monitoring with Icinga and OpsGenie

How many servers can be managed by one system administrator? This question is pretty hard to answer since it depends decisively on the tasks that need to be operated. It is clear, however, that the amount of servers one engineer can manage has increased tremendously over the time, and is still growing. Public and private clouds, in combination with automation tools, enables us to automate many daily tasks. In a modern IT infrastructure almost everything can, and should, be automated.

5 Ways to Suppress Alert Noise

We pride ourselves at OpsGenie for being the most reliable and flexible alert and incident management solution. However, what happens when you simply don’t want notifications? Even with escalations, routing rules, and on-call schedules, you may want extra configuration on when you are notified, and for what types of alerts.

7 Tips to Get New Engineers Ready to Be On-Call

Before the philosophy of DevOps, developers would build products, services, and infrastructures , but the responsibility for maintaining them would shift to operators, aka system or IT admins. The DevOps philosophy removes the boundary between Operations and Development teams, making system reliability a shared responsibility of all parties.