Operations | Monitoring | ITSM | DevOps | Cloud

Defining and measuring your SLIs and SLOs

Customers expect that online services are available all the time. The truth is that outages happen to almost everyone because providing 100% service availability is challenging and costly. Creating reliable and profitable service is, amongst other things, finding the balance between application availability, costs and time to market. Faster feature delivery means less availability as constant changes to production may cause issues and introduce bugs.

Investigate critical alerts on the go with the Datadog mobile app

The Datadog mobile app provides real-time visibility into critical alerts, incidents, and application performance metrics across your entire environment, helping you troubleshoot directly from your mobile device. On-call engineers can quickly evaluate the conditions that triggered an alert, determine its urgency, and decide the next course of action—anywhere, anytime.

Inside the migration from Consul to memberlist at Grafana Labs

At Grafana Labs we run a lot of distributed databases. These distributed databases all make use of a hash ring in order to evenly distribute workloads across replicas of certain components. For a more detailed description of the architecture of our projects, check out our Mimir architecture docs.

Create and Manage Maintenance Windows Through PagerDuty Mobile App

In order to respond in real-time to urgent, critical digital incidents, on-call responders must be able to take action from anywhere. But when on-call responders become overwhelmed with alerts, they often just “ignore them” because they cannot tell the difference between a real alert and a false one.

Win One New Client Every Month - The ultimate MSP marketing strategy in just 60 mins with Paul Green

Looking to grow your MSP? Want to win more customers? Join us for NinjaOne's exclusive webinar with MSP growth strategy expert Paul Green, who will present a turnkey solution on how you can create an effective marketing infrastructure and start winning new customers in no time! Paul has been specializing in MSP marketing and growth since 2016. He is the founder of Paul Green's MSP Marketing, author of the book "Updating servers doesn't grow your business", and the host of his very own podcast!

Code-level Application Monitoring for Every Developer

The monitoring, tooling, and observability space is crowded. It’s hard to keep track of what most tools in this category originally set out to do— but if we had to guess… they were probably built to support monolithic architectures with complex systems, to give Ops and IT a way to minimize the impact of an outage.

How I monitor cloud application costs in one simple but powerful dashboard

Although there are many great tools out there to get on top of application monitoring, there’s one vital metric that’s often overlooked by us technical folks – cost. In the days of running apps on servers in private datacenters, the kit was a one-time purchase that the systems team had to deal with. But running apps in public clouds is a different story. Whether you’re running on VMs, containers in Kubernetes, or entirely serverless, execution of your code adds to the bill.