Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

How does SIGNL4 provide for truly reliable alerting?

Of course, one expects an alerting solution to be reliable. This is important because a missed alert can have a significant impact on the business. It is about IT uptime, disruptions in production or other critical system conditions. Business processes, production workflows and therefore money, the reputation of the company or even the health of the employees are at stake. But what does reliable alerting actually mean and how is it achieved?

Blue Matador + Squadcast: Alert Routing Simplified

Blue Matador is the fastest, easiest way to set up AWS infrastructure monitoring, allowing small teams to fully monitor their cloud operations with no manual setup. If you use Blue Matador for your cloud monitoring requirements, you can integrate it with Squadcast, an end-to-end Incident Response tool, to route alerts from Blue Matador to the right users in Squadcast with ease.

4 Ways to Reduce Your Mean Time to Resolution

Dealing with a high MTTR in your network? Auvik Network Management is a comprehensive network monitoring and troubleshooting solution. With over 50 pre-configured alerts, it keeps you informed about critical network events. Users have the flexibility to customize these alerts and control notification frequency so that they have all the essential context to be able to fix issues.

NOC Success Like Never Before: Automation Strategies for All-new Incident Management

Network Operations might never be the same. But then again, why would anyone want it to be? The power of automation and orchestration can bring incredible value to the Network Operations Center (NOC), including the business-critical call to get proactive and ahead of the incidence response and management game. It’s more than a towering volume of events – it’s the complexities involved, too.

Global AWS Orchestration with Runbook Automation

It is common for companies to have multiple AWS Accounts, and as it turns out, there are cases where certain operational tasks need to be performed on EC2’s that reside in each account. Examples of this include standardizing practices for auditing, patching, and incident-response – such as retrieving diagnostics or remediation. This demo showcases how Runbook Automation orchestrates commands and scripts on EC2’s spanning numerous AWS accounts through an integration with Systems Manager (SSM).