Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

What Is Replatforming? Everything You Need To Know

Depending on your business reason for migrating to the cloud, you can either move in one go or incrementally. If, however, you prefer to retain some of your operations, app design, or workflows on-premise, you can still do so based on the cloud migration strategies you use. Sometimes, it makes more sense to modify your existing system rather than make too many changes all at once. Cost, data security, and service availability (thus revenue and customer experiences being impacted) are just three concerns.
Sponsored Post

Top Five Pitfalls of On-Call Scheduling

On-call schedules ensure that there's someone available day and night to fix or escalate any issues that arise. Using an on-call schedule helps keep things running smoothly. These on-call workers can be anyone from nurses and doctors required to respond to emergencies to IT and software engineering staff who need to fix service outages or significant bugs. Being on-call can be challenging and stressful. But with the proper practices in place, on-call schedules can fit well into an employee's work-life balance while still meeting the organization's needs.

Detect user pain points with Datadog Frustration Signals

Whether you run an ecommerce site, a digital publication, or any other customer-facing service, delivering optimum user experiences is key to the success of your business. Customers can grow frustrated and abandon your site when they run into hurdles such as JavaScript errors or confusing page designs, and that frustration negatively impacts your company’s bottom line.

Analyze wait events and in-flight queries with the Datadog Database List

When you’re operating databases at scale, being able to get real-time insights across all your databases is essential for addressing issues and identifying areas for optimization. Datadog Database Monitoring’s Database List allows you to monitor your entire database fleet in one place, so you can quickly identify and troubleshoot overloaded hosts and gauge the impact of problematic queries throughout your infrastructure.

Why More Incidents Are Better

Ask most SREs how many incidents they’d have to respond to in a perfect world, and their answer would probably be “zero.” After all, making software and infrastructure so reliable that incidents never occur is the dream that SREs are theoretically chasing. Reducing actual incidents by as much as possible is a noble goal. However, it’s important to recognize that incidents aren’t an SRE’s number one enemy.

Resolve Actions PRO - Introduction to the Event component (to pause and continue automations)

This video describes the capability of the Event component in Resolve Actions PRO. You may need to pause your automation mid-flow whilst it waits for an external event to occur elsewhere in your IT or network environment.