Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Best practices for alerting on Kubernetes

Jul 9, 2020 By Jorge Salamero Sanz In Sysdig

A step by step cookbook on best practices for alerting on Kubernetes platform and orchestration, including PromQL alerts examples. If you are new to Kubernetes and monitoring, we recommend that you first read Monitoring Kubernetes in production, in which we cover monitoring fundamentals and open-source tools. Interested in Kubernetes monitoring?

Read Post

Sysdig

Read more about Best practices for alerting on Kubernetes

Logs and Metrics and Traces, Oh My!

Jul 9, 2020 By Splunk In Splunk

There are a lot of aspects to supporting modern applications, and it all starts with the data applications produce that give visibility and insights into what is going on. In the first episode of Dissecting DevOps, Dave and Chris review the differences between logs, metrics, and traces. Find out how these sources of data help you better understand and support your application.

View Video

Splunk

Read more about Logs and Metrics and Traces, Oh My!

[Webinar] Best practices to manage AWS cloud

Jul 9, 2020 By Site24x7 In Site24x7

Site24x7 offers unified cloud monitoring for DevOps and IT operations. Monitor the experience of real users accessing websites and applications from desktop and mobile devices. In-depth monitoring capabilities enable DevOps teams to monitor and troubleshoot applications, servers and network infrastructure including private and public clouds. End user experience monitoring is done from 90+ locations across the world and various wireless carriers.

View Video

Site24x7

Read more about [Webinar] Best practices to manage AWS cloud

How to Create a Python Stack

Jul 9, 2020 By Mukul Khanna In Scout

All programming languages provide efficient data structures that allow you to logically or mathematically organize and model your data. Most of us are familiar with simpler data structures like lists (or arrays) and dictionaries (or associative arrays), but these basic array-based data structures act more as generic solutions to your programming needs and aren’t really optimized for performance on custom implementations. There’s much more than programming languages bring to the table.

Read Post

Scout

Read more about How to Create a Python Stack

SRE Report 2020 - Balancing 'Dev' and 'Ops'

Jul 9, 2020 By Catchpoint Systems In Catchpoint

We recently released Catchpoint’s SRE Report 2020 that analyzed results from the SRE survey we conducted early this year along with a recent addendum survey. The report offers a detailed look at the current state of SRE and how the shift to an all-remote work environment has impacted SRE teams. In this blog, we take a deeper look at one of the report highlights – ‘Heavy Ops Workload Comes at a Cost’.

Read Post

Catchpoint

Read more about SRE Report 2020 - Balancing 'Dev' and 'Ops'

Pandora FMS 747 Release

Jul 9, 2020 By Pandora FMS team In Pandora FMS

These release notes describe new features, improvements and fixed issues in Pandora FMS NG 747. They also provide information about upgrades and describe some workarounds for known issues.

Read Post

Pandora FMS

Read more about Pandora FMS 747 Release

5 Serverless AWS Core Services Everyone Should Have in Their Starter Toolkit

Jul 9, 2020 By Mariliis Retter In Dashbird

When first looking into serverless migration and its architecture, it can feel like you’re staring down an endless shopping aisle of critical serverless tools that all need to be put into your basket straight away. Some services seem to offer the same function, while others can feel wildly different - both, as a result, can instill some doubts as to what is really necessary for your business and serverless application.

Read Post

Dashbird

Read more about 5 Serverless AWS Core Services Everyone Should Have in Their Starter Toolkit

Timestamps On Downtime Alerts

Jul 9, 2020 By Downtime Monkey In Downtime Monkey

We've made a useful improvement to Downtime Monkey alerts. Each downtime alert now includes a timestamp that shows the time that the website went down and each uptime alert includes a timestamp that shows the time that the website came back up. This turned out to be more work than expected, largely because we thought we'd knock it out in under an hour :) Although it wasn't totally straightforward to develop, the end-result is incredibly simple to use...

Read Post

Downtime Monkey

Read more about Timestamps On Downtime Alerts

How to use check aggregates in Sensu Go

Jul 9, 2020 By Aaron Sachs In Sensu

Aggregates, which allow you to monitor groups of checks or entities, were a much-beloved feature in Sensu Core (the predecessor to Sensu Go) — Ben Abrams describes them as “awesome” in his post on alert fatigue, noting that aggregates are like having “a bunch of nodes behind a load balancer where each node is healthchecked, and if a node drops out it may not be worth waking someone up in the middle of the night.”

Read Post

Sensu

Read more about How to use check aggregates in Sensu Go

OpManager now supports SMSEagle, Twilio, and Clickatell, so you can get SMS alerts anywhere!

Jul 8, 2020 By OpManager In ManageEngine

IT admins need to know the status of their IT devices, servers, routers, switches, and firewalls. To meet this need, OpManager has a highly responsive and robust notification and alerting system that sends alerts via email, Slack, and even SMS. Murphy’s law says anything that can go wrong will go wrong, and if you’re in IT, you’re probably familiar with how easily things can go wrong.

Read Post