Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Why Remote Work Just Works - Hear It From Our Grafanistas

Several Grafanistas talk about their remote work experience at Grafana Labs. Being remote-first enables our team to be based where they feel most productive and to ensure that work and life aren't in competition. And remote-first is *not* remote only. Grafanistas enjoy the opportunity to come together during team offsites or in shared co-working spaces. Connection is important.

Understanding How a Log Correlation Engine Enables Real-Time Insights

Tax season is notoriously most people’s least favorite time of year. For people who complete their own tax returns, the process becomes an agonizing one of looking at small pieces of paper, matching numbers to the lines that ask for information, and comparing various inputs. In essence, doing your taxes makes you a correlation engine. Now, imagine taking this tedious process and applying it to the terabytes of data that your environment generates daily.

7 Senior-Level AI Debugging Tools Compared

Every dollar spent on engineering is a bet on the future. But look at your engineering team's sprint backlog and you’ll see a non-trivial amount of that capital is spent on repairing the past. For the last ten years, if you asked a VP of Engineering what the solution was, the answer was always the same: better monitoring. Throw more telemetry at the wall. Build a bigger dashboard. Send more alerts at 3 AM. It was the only available tool, so it became the entire thesis.

Shopify Cyber Monday outage - December 1, 2025

On December 1, 2025, Cyber Monday, the biggest online shopping day of the year, Shopify suffered a widespread outage that left many merchants unable to access their stores or process orders. At a time when every minute of uptime translates directly into revenue, the disruption caused immediate concern across the ecommerce community. StatusGator detected the issue within minutes, sending an Early Warning Signal 10 minutes before Shopify published its official acknowledgement.
Sponsored Post

Adding a CDN to a load balancer (for a much faster website)

Here at Raygun, we like to go fast. Really fast. That's what we do! When we see something that isn't zooming, we try to figure out how to make it go faster. So today, we're answering a simple (and relevant) question; how do we make our public site, raygun.com, much, much faster? The answer, at first glance, is simple-we build it into a Content Delivery Network (CDN). But what if you have a load balancer serving your website, and you don't want to rebuild everything to serve from a CDN? Well, that's more complicated. Let's start by describing the issue.

What's new in the Grafana Image Renderer: higher-quality results, security enhancements, and more

Whether it’s for an email or that upcoming presentation, many Grafana users like to share their favorite dashboards or panels outside of Grafana itself. The Grafana Image Renderer is a backend service for Grafana that helps you do just that by rendering panels and dashboards as images, such as PNGs and PDFs, via a headless browser. It’s commonly used to support Grafana features like exporting dashboards, generating images for alert notifications, and creating PDF reports.

kubectl logs Command Reference and Documentation

The kubectl logs command retrieves container logs from Kubernetes pods. It supports real-time log streaming with -f, time-based filtering with --since, viewing previous container instances with --previous, and accessing logs from specific containers in multi-container pods using -c.

5 Network Issues That Affect Remote Offices (Not HQ)

Your headquarters runs flawlessly. Zero network complaints. But your remote offices? Constant connectivity problems, dropped video calls, and frustrated employees filing help desk tickets you can't solve. Remote offices experience 3x more network issues than headquarters, yet most of the IT teams have zero visibility into what's actually failing.