Operations | Monitoring | ITSM | DevOps | Cloud

October 2022

Reduce MTTR and improve UX with Grafana Enterprise: Inside Optum's observability stack

Among the 12 greatest stressors in life, six revolve around healthcare issues. From loss of a loved one to pregnancy and even retirement, these events often involve interactions with healthcare services — interactions that can either add to an individual’s stress or, ideally, help alleviate it.

How to monitor the health and resource usage of Kubernetes nodes in Grafana Cloud

The spine is essential to perform every activity, like crawling, walking, or swimming. Just as the spine is necessary to enable these functions, your Kubernetes infrastructure needs a backbone to be efficient and effective. So if Kubernetes clusters act as the spine of your architecture, then Kubernetes nodes are like the vertebrae — they make up a Kubernetes cluster in the same way the vertebrae form the spinal column.

Grafana and Cilium: Deep eBPF-powered observability for Kubernetes and cloud native infrastructure

Today, Grafana Labs announced a strategic partnership with Isovalent, the creators of Cilium, to make it easy for platform and application teams to gain deep insights into the connectivity, security, and performance of the applications running on Kubernetes by leveraging the Grafana open source observability stack.

How to manage high cardinality metrics in Prometheus and Kubernetes

Over the last few months, a common and recurring theme in our conversations with users has been about managing observability costs, which is increasing at a rate faster than the footprint of the applications and infrastructure being monitored. As enterprises lean into cloud native architectures and the popularity of Prometheus continues to grow, it is not surprising that metrics cardinality (a cartesian combination of metrics and labels) also grows.

How to autoscale Grafana Loki queries using KEDA

Grafana Loki is Grafana Labs’ open source log aggregation system inspired by Prometheus. Loki is horizontally scalable, highly available, and multi-tenant. In addition, Grafana Cloud Logs is our fully managed, lightweight, and cost-effective log aggregation system based on Grafana Loki, with free and paid options for individuals, teams, and large enterprises.

Get better insights from industrial IoT data with Grafana

Varland Plating has been in the electroplating business since 1946. At their industrial job shop in Cincinnati, Ohio, they perform complex electrochemical treatments on steel, brass, and copper manufactured parts to create everything from corrosion-resistant building materials to decorative metals.

Grafana 9.2 release: Troubleshooting Grafana panels with a new support feature

Ever run into issues building a panel in your Grafana dashboards? To help with those issues, the current support process for Grafana, Grafana Cloud, and Grafana Enterprise often requires many cycles where we request more information. This can be slow, frustrating for both our users and our support teams, and the process makes it difficult to reproduce issues without access to similar data.

Monitoring HPC system health with Grafana and Psychart

Nicolas Ventura is a critical facilities engineer at NERSC, with experience in both mechanical and computer systems. The National Energy Research Scientific Computing Center (NERSC) is a modern data center that’s home to two powerful high-performance computing (HPC) systems used for worldwide scientific research in genetics, physics, geology, and more. As such, the infrastructure team at NERSC has to closely track the facility conditions to ensure optimal operations.

5 key benefits of Kubernetes monitoring

Kubernetes made it much easier to deploy and scale containerized applications, but it also introduced new challenges for IT teams trying to keep tabs on these newly distributed systems. Ops teams need proper visibility into their Kubernetes clusters so they can track performance metrics, audit changes to deployed environments, and retrieve logs that help debug application crashes.

Grafana 9.2 release: New Grafana panel help options, Grafana oauth updates, simplified variable editor for Grafana Loki, and more!

Welcome to Grafana 9.2, a jam-packed minor release with a wide range of improvements to help you create and share Grafana dashboards and alerts. Along with new developments for public dashboards and support for Google Analytics 4 properties, Grafana 9.2 offers new ways to connect with support teams about panel issues, a simplified query variable editor for Grafana Loki, improvements to access control, and much more.

Grafana k6 one year later: Lessons learned after an acquisition

A few years ago, I was meeting with venture capitalists and private equity firms about the future of k6, the open source performance testing tool that we created in 2016 and open sourced in 2017. After talking about the k6 product mission — to give modern engineering teams better tools to build reliable applications — one investor challenged us to create an even bigger vision for the company: What if we acquired a company to broaden the k6 story?

Announcing Grafana Cloud Link, a gateway from any local Grafana instance to Grafana Cloud

If you’ve had a local Grafana instance for any length of time, it’s likely dialed in just how you like it, and that’s a good thing. If you are working within Grafana Cloud, by contrast, you are using a heavily opinionated experience that our teams are building, managing, and provisioning. As a result, we serve up solutions that users can work with out of the box and can use to build their stack.

Set up instant SNMP monitoring with the new SNMP integration in Grafana Cloud

Simple Network Management Protocol (SNMP) is an internet protocol that is used to collect information about network devices and manage them. Most of the modern devices connected to a network support SNMP, such as routers, switches, servers, printers, and more. There are three different versions of SNMP (v1, v2, and v3). It most commonly operates on UDP ports 161 and 162. The most common versions being used are v1 and v2. The data can be collected from a network device through SNMP via polling.