Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The hidden costs of shadow AI: CPU drain, data risk, and network bottlenecks

The risk of headline-grabbing incidents, like Samsung’s ChatGPT data leak, related to AI usage outside of the authorization and control of IT (a.k.a. shadow AI) is clear. Most IT teams recognize that a high-profile incident can have serious repercussions. However, the risk of shadow AI goes well beyond the risk of a single incident. In fact, the recent Komprise IT Survey indicates that 79% of organizations have experienced negative outcomes from sending corporate data to AI.

Visualize Logs Alongside Metrics: Complete Observability for Slow MongoDB Operations

MongoDB’s strength of flexible schema and fast iteration can also hide costly queries until they surface as user-facing latency, replica lag, or spiky CPU. A handful of slow operations can impact the cache, starve other workloads, and cascade into timeouts across services. Monitoring slow queries gives you an early warning system for index gaps and query-plan regressions introduced by code deploys, schema changes, or shifting data shapes.

AWS metric ingestion for less: Save money and get near real-time stream into Grafana Cloud

There’s a new way to ingest AWS metrics into Grafana Cloud that makes observing your AWS resources more cost-effective, easier to operate, and more accurate. You can now stream metrics into the AWS Observability app in Grafana Cloud in near real-time thanks to our new integration with Amazon CloudWatch and Amazon Data Firehose. We’re already using it internally, and we’re finding that it’s not only easier to operate—it’s at least five times more cost-effective.

This Month in Datadog - August 2025

In the August episode of This Month in Datadog, Jeremy shares how you can make more informed cloud cost decisions, gain insights into your LiteLLM-powered applications, and secure Kubernetes infrastructure with Datadog Workload Protection. Later in the episode, Danny puts the spotlight on Datadog Kubernetes Autoscaling, which helps you deliver cost savings without sacrificing performance.

Kentik Traffic Costs Workflow Demo

Learn how Kentik's automated traffic cost workflow provides instant visibility into network traffic costs, enabling you to optimize spend, improve margins, and make smarter business decisions. In this demo, you'll see practical examples like evaluating costs by AS group and downstream customer, helping network, finance, and commercial teams take immediate, actionable steps to reduce costs and boost efficiency.

Logs are Generally Available (Still logs, just finally useful)

When we started building Logs in Sentry we had one goal: make them useful for real debugging, not just another high-volume text storage. This meant making them "trace connected" from day one. This let us ensure they were tightly connected to the actions and performance happening in your application, right where developers already go to investigate errors, performance, and latency issues. Now, Logs is out of beta and generally available to everyone.

Bridging the Gap: Legacy Systems and Modern Observability

Technology moves quickly and while the spotlight has shifted to dynamic, cloud-based systems, many organizations have legacy applications and infrastructure that they must maintain. In this fireside chat, Datadog’s Matt Moore (Principal Observability Strategist) will host James Flores (Enterprise Systems Engineer) at Australian Community Media to discuss their journey of modernization and bridging legacy systems with the cloud using a bit of ingenuity and observability.

Cost Controls and so Much More: Issue Detection Through Usage Analysis

Keeping tabs on cloud spending across multiple organizations and vendors, including Datadog, can be tough and costly. If you're not tracking expenses, you're also missing other critical insights. The Flight Centre Travel Group (FCTG) faced this when moving to Datadog, needing to monitor costs across numerous organizations and over 180 Azure subscriptions. After a rapid migration, new cost reports quickly revealed more than just financial benefits. Unusual spending patterns often highlighted incidents, bugs, or security issues, offering early warnings about internal system problems.