Here at Grafana Labs we are lucky to work with many partners around the globe. From these partnerships, we get great inspiration into some clever use cases on how Grafana and Prometheus can be used to great effect for service monitoring and availability. We came across this use case that our partner OpenAdvice came up with for their client base, and we thought it was too good to keep secret!
There’s a new Azure agent in town – Azure Monitor Agent (AMA). I hear you saying, “Wait, aren’t there already enough Azure agents? So why another new agent?” A very valid question! Indeed, there are already a few Azure monitoring agents: the Log Analytics Agent, the Diagnostic Extension agent, and the Telegraf agent (and also the Dependency agent if you want to count it separately).
A recurring theme of modern monitoring tools is that they focus on the user, not the systems. To put it another way, while monitoring your infrastructure is essential, it matters more when your customers can’t interact with your application. That would mean lost sales, lost time, unhappy people – and unfortunately for us, unhappy people like to share how grumpy they are. It’s bad for business! But that’s why we’ve made Availability Monitoring in SquaredUp.
In an organization, purchases in IT happen in huge numbers. It is one of the key responsibilities of the IT team to keep track and manage these purchases to optimize spending. But manually tracking these purchases and updating stock using spreadsheets is tedious. It causes blocks in the purchase order process and can lead to discrepancies and human errors. The problem is the lack of an efficient tool to manage purchases.
CERN, the European Organization for Nuclear Research, studies the fundamental structure of the universe by examining the behavior of subatomic particles using highly complex scientific instruments, including the world’s largest and most powerful particle accelerator. In 2015, the research institute deployed Mattermost across the organization to consolidate messaging platforms while providing its community of researchers, scientists, and technologists with a self-hosted, secure messaging space.
Catchpoint detected an AWS outage earlier today, 25th November 2020. Enterprises that had their applications or services running on AWS US East 1 were impacted. Even some of the AWS products were impacted by the outage. We noticed Amazon services such as Athena slowing down at 5AM PST followed by intermittent HTTP 500 errors that started around 5:15AM PST. However, by 5:30AM PST, the fallout from the outage was evident.