Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Cyber Resilience: The Key to Security in an Unpredictable World

Join Ed Bailey and Jackie McGuire as they delve into the topic of cyber resilience and its growing significance in today's digital landscape. In this informative video, you will learn what cyber resilience means, why it's important, and how to manage and improve it in an increasingly unpredictable world. With cyber threats becoming more sophisticated and frequent, cyber resilience has become a critical aspect of protecting personal and business assets. This discussion is perfect for anyone looking to better understand the importance of cyber resilience and how to safeguard against potential threats.

Extending Netdata's anomaly detection training window

We have been busy at work under the hood of the Netdata agent to introduce new capabilities that let you extend the "training window" used by Netdata's native anomaly detection capabilities. This blog post will discuss one of these improvements to help you reduce "false positives" by essentially extending the training window by using the new (beautifully named) number of models per dimension configuration parameter.

Cloud Providers Health Report - January 2023

Check our January 2023 health report on the top most popular cloud providers. We analyze the health of the cloud providers based on the number of outages and problems during the month. The source of the data is made available by the cloud providers themselves via their status page. We normalize it and use it to generate the report.

6 Real-World Status Page Examples: And What You Can Learn From Them

A status page is the most effective way to stay in touch with your users and quickly inform them about any outages or ongoing maintenance. As explained in our previous article, status pages can offer many benefits such as cost savings and a reduced number of support tickets. Creating a status page can significantly improve your incident management and relationships with your customers.

Supporting Key Business Applications in the Cloud is Challenging: A Real-World Case Study

These days, many IT executives believe that it is easier to deploy applications in the cloud than on-prem. They are also often under the misconception that once an application is hosted in the cloud, it is the responsibility of the cloud service provider to maintain the availability and performance of the application.

The role of APM and distributed tracing in observability

Application performance management (APM) and distributed tracing are practices that many teams have been using for years to help detect and mitigate performance issues within applications – while the first one was born in the era of big single-host monoliths, the latter is especially useful for distributed applications that use a microservices architecture, in which tracing is critical for pinpointing the source of performance issues.

How to set up Golang application performance monitoring with open source monitoring tool - SigNoz

In this article, learn how to setup application monitoring for Golang apps using an open-source solution, SigNoz. If you want to check our Github repo before diving in 👇 Scalability, Reliability, Maintainability... The list goes on for the benefits of microservices architecture in today's world. But along with these benefits also comes the challenges of complexity.