Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Alert fatigue, part 3: automating triage & remediation with check hooks & handlers

In many cases — as you’re monitoring a particular state of a system — you probably know some steps to triage or in some cases automatically fix the situation. Let’s take a look at how we can automate this using check hooks and handlers.

Alert fatigue, part 2: alert reduction with Sensu filters & token substitution

In my previous post, I talked about the real costs of alert fatigue — the toll it can take on your engineers as well as your business — and some suggestions for rethinking alerting. In part 2 of this series, I’ll share some best practices for fine-tuning Sensu to help reduce alert fatigue.

Alert fatigue, part 1: avoidance and course correction

Alert fatigue occurs when one is exposed to a large number of frequent alarms (alerts) and consequently becomes desensitized to them. This problem is not specific to technology fields: most jobs that require on-call, such as doctors, experience it in slightly different manners, but the problem is the same.

Building + testing open source monitoring tools

At Monitorama 2018, I shared some of the cool process and knowledge I’ve learned from developing a product for people other than myself to consume. After spending six years on call, I now build software that wakes people up in the night — AKA, infrastructure and tooling for systems monitoring and performance analysis. As someone who’s been there, I’m conscientious about building quality software that people delight in using.

Using NGINX for targeted access to the Sensu Core 1.4 API

NGINX can be used as a proxy to provide authenticated access to specific endpoints for any RESTful service API — including the Sensu API. Below I provide an NGINX configuration to grant external service provides narrow access to only create check results in the Sensu 1.4 API external service providers. But first, here's some backstory of how I got here.

Migrating to Sensu, as told by a Nagios refugee

One of our favorite stories at Sensu is hearing how our customers are using, repurposing, and even replacing their Nagios setup. The Sensu monitoring event pipeline lets you run your existing Nagios plugins while also preparing you for what’s next; while Nagios has been tried and true for many, Sensu empowers businesses to modernize their infrastructure with a comprehensive, future-proof monitoring solution.

Migrating to 2.0: the good, the bad & the ugly

The Sensu 2.0 release is dropping in October, and we’re already excited! As the version numbering implies, Sensu 2.0 includes significant changes from Sensu 1.x. Not only has the software been entirely re-engineered in Go — easing deployment significantly — the exposed APIs and internal data structures have changed to accommodate new features.