Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Managing observability spend with Grafana Cloud's Cost Management Hub

Learn how Grafana Cloud helps analyze, manage and optimize observability spend from a central location called the cost management hub. The move to cloud-native architectures like K8s and Prometheus has caused an unprecedented increase in telemetry data that has resulted in observability bills skyrocketing. With Grafana Cloud and the central cost management hub, you will be able to answer any cost-related question with the tools to inspect, attribute, optimize and monitor your observability spend.

ObservabilityCON 2023 - Opening Keynote (Live)

👋 Coming to you live from London, Grafana ObservabilityCON 2023's keynote introduces the latest developments in the open and composable LGTM (Loki, Grafana, Tempo, Mimir) observability stack AND many exciting announcements! Our keynote features CEO/Co-founder Raj Dutt, CTO Tom Wilkie, and members of the Grafana Labs engineering team.

Driving Efficiency and Advanced Insight - Explore Virtana's Latest Innovations

Virtana is proud to announce a series of new capabilities focused on empowering our customers with advanced AI-driven capabilities, enhanced user interfaces, and deeper integrations, all aimed at optimizing application and infrastructure observability. Let’s dive into these innovations and see how they revolutionize the way IT professionals interact with their environments.

The Future of Operations: AI-powered Internet Performance Monitoring

At Catchpoint, our philosophy is that AI should not be adopted simply for the sake of AI itself. Instead, it should be embraced when it proves to be the most effective solution for addressing a particular business challenge. While the world is currently in the fervor of the oncoming AI revolution, our industry-leading IPM platform has quietly harnessed the potential of Artificial Intelligence for years.

Java Application Monitoring - How IT Ops can Diagnose Memory Leaks at Scale

Many server-side applications are written in Java and often process tens of millions of requests per day. Key applications in various domains like finance, healthcare, insurance and education are often Java-based. When these applications slow down or fail, they affect the user experience and in turn, reduce business revenue. Behind many web forms or form-like GUIs there will often be a Java application.

How To Investigate a Reported Problem

Getting to the root cause of a problem in cloud-native environments requires engineers to navigate through immense complexity within a distributed system. Oftentimes, you didn’t write the code and you lack the background and context to quickly understand what’s going on when a problem occurs. The stakes are even higher when a problem is reported - meaning it’s already started to impact the business and the executives and your customers are not pleased.

Kubecon North America 2023 event recap

As autumn graced the vibrant city of Chicago, I had the distinct opportunity to immerse myself in the heart of innovation and camaraderie at the CNCF’s Kubecon North America conference. Over the span of four remarkable days, from Nov 6-9, I was fortunate enough to walk alongside the many enthusiasts, contributors and organizers of open source and cloud native communities.

From Oops to Ops: SLOs Get Budget Rate Alerts

As someone living the Honeycomb ops life for a while, SLOs have been the bread and butter of our most critical and useful alerting. However, they had severe, long-standing limitations. In this post, I will describe these limitations, and how our brand new feature, budget rate alerts, addresses them. We usually don’t have SREs writing product announcements, but I’m so excited about this one that I said, “Screw it, I’m doing it!”