Operations | Monitoring | ITSM | DevOps | Cloud

%term

How a Production Outage Was Caused Using Kubernetes Pod Priorities

On Friday, July 19, Grafana Cloud experienced a ~30min outage in our Hosted Prometheus service. To our customers who were affected by the incident, I apologize. It’s our job to provide you with the monitoring tools you need, and when they are not available we make your life harder. We take this outage very seriously. This blog post explains what happened, how we responded to it, and what we’re doing to ensure it doesn’t happen again.

Kusto 101 - A Jumpstart Guide to KQL

This blog post is for anyone needing a jumpstart into the world of Kusto. Perhaps you’ve heard about Kusto and are just curious. Maybe you’re just starting to use Azure Monitor for your application monitoring. You might even be getting skilled up in anticipation of the new Squared Up for Azure release that will have KQL at its heart. Whatever your reason, set aside the next 10 minutes and we'll get you up to speed with KQL. Ready? KQL stands for Kusto Query Language.

Integrating Opsgenie and Amazon Security Hub

Brief demo of how to integrate Opsgenie with Amazon Security Hub. Use Atlassian Opsgenie Amazon Security Hub Integration to forward Amazon Security Hub findings to Atlassian Opsgenie. Once Amazon Security Hub sends findings to Opsgenie, Atlassian Opsgenie will determine the right people to notify based on on-call schedules and notify them via email, text messages (SMS), phone calls, and iOS & Android push notifications.