Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

What Is IT Mapping and How Can it Prevent the Next Production Incident?

IT infrastructure mapping is the process of creating a visual topology of a network infrastructure. This mapping process helps understand the geographic and interactive layout of a network, which applications depend on. Using infrastructure mapping for troubleshooting, you can quickly understand the relationship between application issues and hardware issues.

Better Uptime Powered Status Page

Better Uptime is a robust uptime monitoring and tracking tool that helps businesses ensure the availability and reliability of their websites and online services. It continuously monitors the performance and availability of a website or application and provides real-time alerts and notifications in case of disruptions or outages. They also provide a relatively simple status page, albeit gorgeous looking and useful for some simple use cases; your team will, in all likelihood, outgrow it at some point.

How to use Kubernetes events for effective alerting and monitoring

Kubernetes, a graduated project of the Cloud Native Computing Foundation (CNCF) ecosystem, is the most prominent and widely used container orchestration systems. It’s used to manage and deploy containers in a wide range of environments, from IoT devices based on Raspberry Pis to enterprise environments consisting of millions of services.

Thousands of Insights at a Glance With Coralogix Alert Map

An effective alerting strategy is the difference between reacting to an outage and stopping it before it starts. That’s why at Coralogix, we’re constantly releasing new features that redefine how alerts are consumed, to enable teams to push their ambitions even further, release with confidence, and tackle issues proactively. Alerts Map is now an indispensable tool for that mission.

The Inevitable - Failures in Distributed Systems

Experiencing failure at scale is as the popular Marvel character Thanos would say “Inevitable”. Memory leaks, software or hardware or network I/O failures are just a few. It’s a problem of simple mathematics, the probability of failing rises as the total number of operations performed increases. With each component used to scale the application, the failure quotient increases. So how do you tackle this so-called “Inevitable” problem that comes with scaling?

How Healthchecks Sends Signal Notifications

When a cron job does not run on time, Healthchecks can notify you using various methods. One of the supported methods is Signal messages. Signal is an end-to-end encrypted messenger app run by a non-profit Signal Foundation. Signal’s mobile client, desktop client, and server are free and open-source software (with some exceptions–read on!).