Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Sponsored Post

Extreme automation and the SAP Cloud ERP journey

Cloud ERP arrives as the new holy grail of ERP architecture: a composable, flexible and scalable collection of core business services working together to meet enterprise ERP needs. Of course, getting there for a large enterprise with significant existing complexity across legacy SAP implementations isn't a trivial task. Much has been written about S/4HANA migration, but less explored are the benefits of automation solutions used for the regular operations of SAP to migration projects. These solutions offer a number of accelerators and benefits to migration projects and SAP teams, so it is worth exploring.

The ROI of monitoring your Azure environment: Prevent surprises, control costs, boost uptime

Like many cloud providers, Azure offers services that scale with usage. However, unanticipated overutilization of Service Bus, Azure Functions, and SQL databases can incur additional costs. Managing these resources effectively is crucial for keeping the billing framework predictive.

Mission Impossible: Find out the Reasons Why Your Network Is Down (and How to Proactively Prevent Network Downtime)

Your mission, should you choose to accept it, is to prevent network downtime before it takes your business offline. The threat is real. One moment, your network is up. The next calls drop, websites freeze, apps stall, and customers vanish. You hear the dreaded question echoing across departments: “Is the network down?” You’re not alone.

Shedding Light on Kafka's Black Box Problem (with OpenTelemetry)

"All language is but a poor translation." — Franz Kafka This quote by Franz Kafka reminds me of the time when I used to look at metrics from “Apache Kafka” topics trying to figure out what was causing the huge lags and manually deleting the messages in certain partitions to get rid of polluted messages. Yep, pretty lost in translation. I wasn’t aware of the power of observability for a Kafka producer-topic-consumer system.

An Easy and Practical Guide to CDN Monitoring

A CDN delivers your content around the world, making sure users get it quickly and reliably. When it slows down or goes offline, users notice right away. Good CDN monitoring gives your team the information needed to fix issues before they affect users. This guide explains the basics of CDN monitoring and shows practical ways to set it up.

Graylog vs Loki: Key Differences and Use Cases

Logs are a key part of building and running software, but managing them can get complicated fast. As your apps grow and generate logs from many sources, choosing the right tool to store, search, and analyze those logs becomes important. Graylog and Loki are two popular options, each with a different way of handling logs. In this blog, we’ll break down the main differences between Graylog and Loki, how they work, and which types of projects they suit best.

How to Reduce Downtime: Keep Your Business Running Smoothly

Downtime refers to any period when your business operations are interrupted or unavailable due to technical issues. Whether it's caused by unscheduled downtime, like sudden system failures, or planned downtime for regular maintenance, it can significantly impact your business continuity. The effects of downtime can be severe, leading to financial losses, decreased productivity, and a damaged reputation.

Cloud Cost Management & Trends in 2025: Strategies to Optimize Your Cloud Spend

Cloud computing has become the backbone of modern business operations, powering everything from day-to-day collaboration to large-scale digital transformation initiatives. As organizations deepen their reliance on cloud services, the financial stakes continue to grow. According to Gartner, global spending on public cloud services is projected to reach over $720 billion in 2025, a significant increase from nearly $600 billion in 2024.

How to Choose an APM Solution: 5 Critical Questions for 2025

An APM solution, or Application Performance Monitoring tool, is a software application that helps businesses monitor and manage the performance and availability of software applications. APM tools gather data from systems, servers, databases, APIs, and end-user devices to provide deep insights into the root causes of performance issues. APM solutions have evolved far beyond basic monitoring.

Grafana Campfire - Hiring with AI and more about Grafana MCP (Grafana Community Call - May 2025)

In this Campfire community call, we will talk about the new and the future of AI in the field of Observability space and also discuss about the Grafana MCP server to provide access to your Grafana instance and the surrounding ecosystem. Join me (Usman), Matt Ryer, Carl Bergquist, David Kaltschmidt for this exciting session. Special guests: Sarah Zinger, Cyril Tovena and Ben Sully.