Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

9 Best Cloud Logging Services for Log Management, Analysis, Monitoring & More [2021 Comparison]

Log management stopped being a very simple operation quite some time ago. Long gone are the “good old days” when you could log into the machine, check the logs, and grep for the interesting parts. Right now things are better. With the observability tools that are now a part of our everyday lives, we can easily troubleshoot without the need to connect to servers at all. With the right tools, we can even predict potential issues and be alerted at the same time an incident happens.

Benchmarking Grafana Enterprise Metrics for horizontally scaling Prometheus up to 500 million active series

Since we launched Grafana Enterprise Metrics (GEM), our self-hosted Prometheus service, last year, we’ve seen customers run it at great scale. We have clusters with more than 100 million metrics, and GEM’s new scalable compactor can handle an estimated 650 million active series. Still, we wanted to run performance tests that would more definitively show GEM’s horizontal scalability and allow us to get more accurate TCO estimates.

Introduction to cron job monitoring with Healthchecks

Software teams use cron jobs to handle many important tasks like database backups and maintenance scripts. Cron jobs make sure that your applications are behaving as they should, but cron job failures are often silent and not noticed until the problem becomes worse. In this guide, we will learn how to stay aware about cron job failures by using Healthchecks.

Five Reasons to Use Catchpoint for Measuring Core Web Vitals

We are in this together. As part of our continuous efforts to meet customer expectations, we have recently added Core Web Vitals to our performance measurement programs. We are happy to share that these metrics are now a native part of the Catchpoint Platform. DevOps’ SREs, Platform Operations Engineers, and business and monitoring strategists alike will realize a series of key benefits from this addition.

Getting Started with the Splunk Distribution of OpenTelemetry Java

Splunk Distro for OpenTelemetry is a secure, production-ready, Splunk-supported distribution of the OpenTelemetry project and provides multiple installable packages that automatically instruments your Java application to capture and report distributed traces to Splunk APM (no code changes required!), making it easy to get started with distributed tracing!

Improve Your CMDB for Business Outcomes with Application Dependency Mapping

A configuration management database (CMBD) is a centralized repository that stores information about all the significant entities in your IT environment. These can include your hardware, installed software applications, documents, business services, and even the people who are part of your IT system. The CMDB is designed to help you maintain and support the interrelationships between the configuration items (CIs) within a vast IT structure.

Uncover How Your Employees Experience Their SaaS Applications in Real-Time

With employees depending on web applications every day, you can’t risk leaving anything to doubt when it comes to managing your IT estate. Although technology performance might appear “in the green” from IT’s perspective, how often are employees experiencing application outages or slowdowns you’re not aware of? Are they using that highly touted new app you rolled out – or avoiding it because of hidden usability problems?