Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Managing Python Processes with PM2

PM2 is a production-grade process manager that makes management of background process easy. In the Python world we could compare PM2 to Supervisord, but PM2 has some nifty features you might like. With PM2, rolling restarts, monitoring, checking logs and even deploying application has never been that simple. We really value CLI UX, so PM2 is really simple to use and master.

Alert fatigue, part 2: alert reduction with Sensu filters & token substitution

In my previous post, I talked about the real costs of alert fatigue — the toll it can take on your engineers as well as your business — and some suggestions for rethinking alerting. In part 2 of this series, I’ll share some best practices for fine-tuning Sensu to help reduce alert fatigue.

Cloud OnAir: CE TV: Application Observability with LightStep

Observability remains a key challenge as customers embrace DevOps. Join Daniel "Spoons" Spoonhower, the CTO and Founder of Lightstep, a Google Cloud customer, and Yuri Grinshteyn, a Google Cloud Customer Engineer to learn about how Lightstep was built on Google Cloud to enable you to monitor what matters most and diagnose anomalies within seconds across web, mobile, monoliths and microservices.

Will Layer 3 Switches Give Routers the Boot?

Switches are the most common network device deployed on MSP-managed networks, while routers are the least popular—and not by a small margin. The data in Auvik’s recently published report, Managing Network Vendor Diversity: The MSP Challenge, shows switches represent almost half (48%) of all network devices on MSP-managed sites, while routers account for only 6% of the total. Does this mean the death of the router is imminent? In short, no—and here’s why.

Super Monitoring Decorated with 2 Distinctions for Application Performance Monitoring Software

Super Monitoring was recently lauded by a popular software review platform for its steadfast assistance in keeping everyone’s business operations smooth and seamless at all times. For its efficiency in informing users regarding emerging issues and anomalous threats, Super Monitoring was distinguished by the FinancesOnline SaaS review platform with two prestigious awards for 2018: Great User Experience and Rising Star.

Alert fatigue, part 1: avoidance and course correction

Alert fatigue occurs when one is exposed to a large number of frequent alarms (alerts) and consequently becomes desensitized to them. This problem is not specific to technology fields: most jobs that require on-call, such as doctors, experience it in slightly different manners, but the problem is the same.

Kubernetes monitoring with Prometheus - Prometheus operator tutorial (part 3).

We covered how to install a complete ‘Kubernetes monitoring with Prometheus’ stack in the previous chapters of this guide. But using the Prometheus Operator framework and its Custom Resource Definitions has significant advantages over manually adding metric targets and service providers, which can become cumbersome for large deployments and doesn’t fully utilize Kubernetes’ orchestrator capabilities.