Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Sponsored Post

Replay Real Customer API Sessions as Datadog Synthetics Tests

A customer pings support: "I tried to check out twice this morning and got a 500 each time, but it works fine for everyone else." The session ID is in the email. You have full request/response capture in your environment, you have Datadog Synthetics already running browser checks against the same flow, and you still spend the next two hours grepping logs because none of those tools let you say "show me just this user's requests, in order, and re-run them."

Beyond code execution: the strategic case for stateful AI sandboxes

While ephemeral sandboxes are effective for isolated code execution, enterprise AI agents require a more robust context to be reliable. Upsun provides production-like preview environments, complete with byte-level clones of apps and services, offering a higher standard of validation for agentic workflows.

Solving the Complexity of Data Center Operations with Cloud-Based DCIM Software

Managing a growing data center requires accurate, real-time infrastructure data. Outdated tools often miss critical changes, delay decisions, and make it harder to control energy usage, capacity, and risk. Hyperview is a cloud-based Data Center Infrastructure Management (DCIM) platform that helps teams monitor, manage, and optimize their data center infrastructure from one centralized system.

Running Your App in Production

Your app is deployed. Users are signing in. Traffic is flowing. Everything is live. Congratulations, give yourself a pat on the back. Okay that's enough. Now it’s time to get back to work because you’ve officially entered the phase where production starts revealing all the decisions you made three months ago, unsure how it would affect you today. Because deploying an app is one half of the job. And, production environments have a way of exposing: This is where operations begin.

Dashboard Playlists: Cycle Through Dashboards in TV Mode

When we shipped TV mode, we heard almost immediately: “Great, but I have five dashboards and one screen.” A single dashboard on a wall display covers one view of your infrastructure. If you want to rotate between your network overview, database health, application metrics, and infrastructure summary, someone has to walk over and click, or you’re buying more screens. Dashboard playlists solve this.

The Enterprise Buyer's Guide to Service Desk Automation Platforms

Here’s a story that plays out constantly in enterprise IT, and few people talk about afterward. A team runs an evaluation with multiple vendors using a structured scoring process. Then, they make their choice, but six months into deployment, the platform that excelled in every demo is now struggling with the actual environment. The IT leader who signed off is in a room with their CIO, trying to explain why the numbers fail to match the projections.

SLI, SLO, SLA: What They Mean for Load Testing

Most engineers can recite these three terms. Fewer know how they actually connect during a load test. If your team is running performance tests without mapping results to SLOs, you're collecting data without a pass/fail signal. This short gives you the mental model to turn load test output into something your SLA can actually depend on.