Operations | Monitoring | ITSM | DevOps | Cloud

xMatters

xMatters service reliability platform helps DevOps, SREs, and operations teams automate workflows, ensure infrastructure and applications are always working, and rapidly deliver products at scale.

Best Practices Guide for Kubernetes Labels and Annotations

Kubernetes is the de facto container-management technology in the cloud world due to its scalability and reliability. It also provides a very flexible and developer-friendly API, which is the foundation of its control plane. The effectiveness of the Kubernetes API comes from how it manages the Kubernetes resources via metadata: labels and annotations. Metadata is essential for grouping resources, redirecting requests and managing deployments.

Resolve AWS Lambda function failures faster by monitoring invocation payloads

In a serverless application, AWS Lambda functions are typically invoked by JSON-formatted events from other AWS services—like API Gateway, S3, and DynamoDB—and respond with JSON-formatted payloads. Having visibility into these function request and response payloads can provide context around your function invocations and help you uncover the root causes of Lambda function failures.

Making data accessible with sound, a Grafana Labs Hackathon project by Kostas Pelelis

We learned from a visually impaired astronomer that it was possible to use sonification to understand astronomical spectra. So during a hackathon at Grafana Labs we decided to turn time series into audio, and add sound to our alerting systems too. Kostas Pelelis is a Software Engineer at Grafana Labs living in Greece.

JFrog and Upswift: Bringing IoT Software Updates to DevOps Upswift Acquistion

JFrog has acquired Upswift to bring the world of connected devices into the DevOps pipeline! Managing fleets of devices and edge applications remotely - including over-the-air (OTA) updates, security, monitoring, controlling and more - has quickly become unwieldy for most companies, with growth of connected devices expected to reach 24 billion in 2026. But, most of today’s DevOps solutions are not optimized or built to deliver software updates to distributed edge and IoT environments.

Bootstrapping a multi DC cloud native observability stack by Bram Vogelaar

An introduction to Observability and how to setup a highly available monitoring platform, across multiple data centers. During this talk we investigate how to config a monitoring setup across 2 DCs using Prometheus, Loki, Tempo, Alertmanager and Grafana. Bram Vogelaar spent the first part of his career as a Molecular Biologist, he then moved on to supporting his peers by building tools and platforms for them with a lot of Open Source technologies. He now works as a DevOps Cloud Engineer at The Factory.

Testing with k6 + Grafana: Better together by Nicole van der Hoeven

k6 is one of the newest additions to the Grafana Labs family, but what exactly is it, and what does it have to do with Grafana? In this session, Nicole van der Hoeven will demonstrate how to use k6, how to integrate it with Grafana, and why k6 and Grafana are better together. Nicole van der Hoeven is a Developer Advocate at k6, living in the Netherlands.

Tales of A11y In Grafana OS: Introducing Pa11y CI into our pipeline by Alexa Vargas

We want to make Grafana accessible to everyone! In this talk, Alexa will share how Grafana recently introduced Pa11y CI into the Grafana Continuous Integration pipeline. The library supports our developers and contributors to highlight a11y issues. And more importantly, it acts as a gatekeeper, stopping new A11y issues from making it into the project. You will additionally hear about the alternatives that were considered and their challenges. This talk will have everything!