Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

How a Production Outage Was Caused Using Kubernetes Pod Priorities

On Friday, July 19, Grafana Cloud experienced a ~30min outage in our Hosted Prometheus service. To our customers who were affected by the incident, I apologize. It’s our job to provide you with the monitoring tools you need, and when they are not available we make your life harder. We take this outage very seriously. This blog post explains what happened, how we responded to it, and what we’re doing to ensure it doesn’t happen again.

Kusto 101 - A Jumpstart Guide to KQL

This blog post is for anyone needing a jumpstart into the world of Kusto. Perhaps you’ve heard about Kusto and are just curious. Maybe you’re just starting to use Azure Monitor for your application monitoring. You might even be getting skilled up in anticipation of the new Squared Up for Azure release that will have KQL at its heart. Whatever your reason, set aside the next 10 minutes and we'll get you up to speed with KQL. Ready? KQL stands for Kusto Query Language.

Using Vagrant to simplify building Virtual Machines

Oracle’s VirtualBox software is a key tool in software and website development, but can be complicated to configure. Vagrant simplifies the process and enables developers to repeatably build and scrap near-identical Virtual Machines (VM). This post will create a Ubuntu 18.04 Virtual Machine with a local directory mounted on it to make it easier to code on.

Network Emulation. Bringing real-world conditions to the test environment

The Network Emulation is a relevant technology when making tests related to the behaviour of our platform. Let’s look at these situations: All these situations are part of the day-to-day work of the IT managers and all are responded to by developing the necessary tests. However, when we propose to do these tests, two options arise: simulation and emulation of networks. These are two concepts that are often used interchangeably but are actually very different.

Best Tools to Help You Get More Engagement

“You cannot buy engagement. You have to build engagement.” Have you already created your customer engagement strategy? And have you already answered the question “How to increase customer engagement?” If your answer is half yes, half no or simply no, then you will probably face problems that will turn into lost sales opportunities, customer dissatisfaction, and low popularity.

Fireside chat with Kelsey Hightower, part one: on Kubernetes, legacy tech, and the future of monitoring

At Sensu Summit 2018, Sensu CEO Caleb Hailey and CTO Sean Porter sat down with Kelsey Hightower, Staff Developer Advocate at Google Cloud Platform (GCP), for a fireside chat on a variety of topics, including the evolution of the monitoring space, Kubernetes best practices, their opinions on an open core business model, how operators’ jobs are changing, and more. Kelsey, Sean, and Caleb discussing all things Kubernetes and open source at Sensu Summit 2018

How to install Datadog on AWS hosts with Ansible dynamic inventories

Ansible is an automation tool for provisioning, managing, and deploying infrastructure and applications. When building large-scale applications, Ansible enables users to manage and configure their infrastructure across platforms like AWS. Whether you rely on temporary or dedicated hosts, you can use Ansible to create a repeatable process for configuring them with the Datadog Agent.

How to Identify Malicious Code and Stop Web Defacement

In April of 2018, security researcher Kevin Beaumont discovered an interesting case of web defacement on the NHS Insights website. He’d expected to find data related to patient surveys about their experiences with the National Health Service. Instead, he found a very different kind of message. A review of the page’s cache suggested that this eerie music and imposing image had been in place for at least the previous five days.