Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Cassandra Monitoring: 6 Best Practices to Pay Attention To

Apache Cassandra is an open-source, distributed database management system specifically built for organizations needing to handle large volumes of data, including when said data is spread across many commodity servers. Cassandra development began at Facebook but later became an open-source Apache project. Now, it’s widely used by some of the biggest enterprises, like Uber, Spotify, eBay, and smaller developer teams.

Feature Spotlight: Timeline

Lumigo’s Transaction Timeline lets you see in a glance the flow of a transaction across its components and the latency caused by each, allowing you to easily identify bottlenecks and issues. Distributed tracing is a popular method for monitoring and profiling transactions in a microservices architecture. It’s what developers use to pinpoint failures, performance drops and other problems.

How to implement Prometheus long-term storage using Elasticsearch

Prometheus plays a significant role in the observability area. An increasing number of applications use Prometheus exporters to expose performance and monitoring data, which is later scraped by a Prometheus server. However, when it comes to storage, Prometheus faces some limitations in its scalability and durability since its local storage is limited by single nodes.

Five Ways AIOps Can Improve IT Incident Management

Artificial intelligence for IT operations (AIOps) is an emerging technology that can help IT operations teams make sense of operational data. As hybrid infrastructure and cloud-native technologies present new levels of complexity, AIOps is showing great promise in simplifying and transforming digital operations management. In our recent Tech Talk, Five Ways AIOps Can Transform Your Enterprise, OpsRamp’s Eric Cook spoke about the need for AIOps in today’s multi-cloud environments.

Lockdown Bugfixes & Midnight Coding

It's been a strange few months here in Edinburgh. Thankfully Downtime Monkey has been largely unaffected by the lockdown, quietly continuing to monitor websites while the world shuts down. Coding from home has been challenging with kids off school and nurseries closed. However, in the twilight zone silent hours after everyone has gone to bed we've been developing improvements and fixing bugs. Here are the details...

Monitoring Applications that use Okta for User Authentication

As the leading provider of identity and access management and authentication for enterprises, Okta gives employees, partners, suppliers, and customers secure access to the tools they need to do their most important work. With deep integrations to over 6,000 applications, the Okta Identity Cloud enables simple and secure access from any device.

How to scale your monitoring with Sensu clustering

I recently led a webinar for the Sensu community on how to scale your monitoring by setting up a three-node cluster in Sensu Go using Sensu’s embedded etcd. Clustering improves Sensu’s availability, allows for node failure, and distributes network load. In this post, I’ll recap the webinar and provide demos on how to set up, back up, and restore your etcd cluster, including best practices for success.

Find What You Need! All About Our New Search Tool.

Product Releases Update Our new Search tool allows you to quickly build detailed queries without having to leave the page. Our new search tool expands on our column filtering feature. You can now create even more detailed queries by filtering on 20 unique criteria. New for this release is the ability to search Call Stack Function Names, Call Stack File Names, and Call Stack Line Numbers.