Operations | Monitoring | ITSM | DevOps | Cloud

%term

Tinder & Grafana: A Love Story in Metrics and Monitoring

Tinder is the world’s most popular dating app, with more than 26 million matches made each day. But two years ago, when it was time for the L.A.-based company to find and implement a perfect metrics monitoring partner, the process proved to be more slow-burn love affair versus whirlwind romance.

Monitoring Kubernetes Clusters on GKE (Google Container Engine)

The Kubernetes ecosystem contains a number of logging and monitoring solutions. These tools address monitoring and logging at different layers in the Kubernetes Engine stack. This document describes some of these tools, what layer of the stack they address, as well as best practices for implementation including an example from the field, a quick start, and a demo project.

Downsampling and Exporting Stackdriver Monitoring Data

Stackdriver Monitoring contains a wealth of information about cloud resource usage, both for Google Cloud Platform (GCP) and and other sources. This post will explain how to use the Stackdriver Monitoring API to read, downsample, and export data from Stackdriver to BigQuery. Pub/Sub metrics will be used to demonstrate this.

Why Your Website Needs Uptime Monitoring in 2019

When a visitor types your website into the search bar, they intend to see your website and not a broken link/webpage. In a digital world where we want everything right now, a website that is not available right now is a website we probably will not go back to in the future. That much should be obvious to everyone. If you are not careful, your website could be sending your visitors to your competitors. So how do you keep track of your website 24/7? One answer is to use an uptime monitoring service.

Why is the Application Slow? Prove It's NOT the Network!

The one complaint that an IT administrator dreads to receive is one where an end user says, “My application is slow!”. The application in question can be a web application, an enterprise application like SAP, Microsoft SharePoint, or a SaaS application like Salesforce or Office 365. Since the application is accessed over a network, it’s natural that the network team is pulled up first under the suspicion that it’s a network issue.