Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How to automate sending SquaredUp dashboards to Slack with the Notification API

SquaredUp's existing notifications fire when monitors change state. With Notification API, you control the trigger. Send dashboards on a schedule, before meetings, or on-demand through chat commands. In this step-by-step guide, you’ll learn how to automate sending SquaredUp dashboards to Slack. I’ll use Power Automate as the example, but the same approach works with other automation tools such as Zapier, Make, n8n, or even a custom script, as long as it can send an HTTP request.

LLM Observability Explained: Prevent Hallucinations, Manage Drift, Control Costs

Large Language Models (LLMs) are transforming how businesses interact with users, automate workflows, and deliver insights in real time. But as powerful as these models are, running them at scale comes with unique challenges, from hallucinations and latency spikes to cost overruns and user trust issues.

How to Perform Ping Tests: Different Tools and Techniques

If you’re a remote worker struggling with video calls, or a gamer noticing lag, a quick Internet ping test using an online ping tester can give you a simple yes/no answer: Is my connection alive, and how fast does it respond?. But if you’re a network admin or IT professional, that’s just scratching the surface. Business networks are more complex beasts.

Why DEX Scores Must Be Part of Every Total Cost of Ownership Study

Price is not the same as cost. When organizations evaluate new end-user technology investments, whether that’s laptops, operating systems, or management tools the conversation inevitably turns to Total Cost of Ownership (TCO). TCO studies traditionally focus on direct, measurable costs: hardware procurement, software licensing, support contracts, and lifecycle services. But there’s a growing blind spot in these calculations: the employee experience.

Scaling Datadog observability: 1,000 integrations and counting

Integrations have always been central to the Datadog platform, enabling customers to collect the data they need directly from the technologies they use every day. By unifying signals from infrastructure and applications to security and SaaS applications, teams gain both high-level visibility and the ability to drill into the details that matter the most. With more than 1,000 integrations now available, the Datadog ecosystem continues to expand alongside the platforms our customers rely on.

Keep stakeholders informed with Datadog Status Pages

When incidents occur, clear communication can be just as important as fast remediation. Your internal teams need timely updates to stay aligned, and your users want to know what is happening and when they can expect a fix. Without a reliable way to proactively share updates, support teams can get flooded with tickets and customer trust can erode. Datadog Status Pages, now generally available, makes it easy to keep everyone informed through a public or internal web page during outages.

SQL performance improvements: analysing & fixing the slow queries (part 2)

This is part 2 of a 3-part series on SQL performance improvements. A few weeks ago, we massively improved the performance of the dashboard & website by optimizing some of our SQL queries. In this post, we'll dive deeper into the optimisations of queries with indexes.

Datadog vs Splunk: A Side-by-Side Comparison [2025]

Datadog and Splunk are both leading tools for monitoring and observability. Each offers a range of features designed to help you understand and manage your data. Datadog provides tools for tracking application performance and analyzing logs in real-time. Splunk, meanwhile, is known for its powerful log analysis and search capabilities. In this post, we will compare Datadog and Splunk on important aspects like APM, log management, search capabilities, and more.

Zoom Troubleshooting Performance and Connection Issues: The Complete Guide

In an era of remote work and virtual meetings, Zoom has emerged as a lifeline, connecting people across distances and facilitating seamless collaboration. However, like any technological tool, it's not without its fair share of challenges. From occasional performance hiccups to frustrating connection issues, navigating the world of Zoom can sometimes be a daunting task.

How DreamHost Slashed Memory Usage by 80% and Scaled to 76 Million Time Series

For any growing business, there comes a point where the tools that once worked perfectly begin to show their limits. This is especially true for monitoring infrastructure. As your user base, services, and data volumes expand, the pressure on your monitoring stack intensifies. For web hosting leader DreamHost, with over 1.5 million websites to manage, their existing open-source solutions simply couldn’t keep up.