Operations | Monitoring | ITSM | DevOps | Cloud

Observability trends in Japan: Insights from Grafana Labs' latest survey

Japanese organizations are focused on controlling costs and limiting complexity—and they might be getting ready to broaden their adoption at just the right time, according to analysis of a micro survey on observability recently conducted by Grafana Labs. Observability is an evolving space in Japan, and this is the first time Grafana Labs has run a Japanese version of our annual Observability Survey.

Invisible dependencies, visible impact: Lessons from the Google Cloud outage

June 12, 2025. A date most of the Internet won’t remember — but anyone relying on Google Cloud will. In the span of minutes, a routine quota update snowballed into global disruption. APIs stopped responding. Dashboards stayed green. And across continents, teams scrambled to figure out if the problem was theirs — or Google's. It wasn’t a cyberattack. It wasn’t a datacenter fire.

Our Golang Stack in 2025

In our Go projects, we rely on a consistent and battle-tested stack of libraries that help us build reliable, maintainable, and scalable systems. We started using Go in our stack many years ago (before Go v1) and therefore many of our choices have changed over the years. Here in this post, I wanted to share some of the libraries we use regularly to power our Go apps.

How to Use an SLA Uptime Calculator to Understand Service Availability

TL;DR A Service Level Agreement (SLA) defines the required uptime for a service. An SLA uptime calculator helps convert uptime percentages into actual allowed downtime across different timeframes. This guide explains how these calculators work, why uptime matters, and how to monitor performance to meet SLA targets.

OpenTelemetry for Go: measuring the overhead

Everything comes at a cost — and observability is no exception. When we add metrics, logging, or distributed tracing to our applications, it helps us understand what’s going on with performance and key UX metrics like success rate and latency. But what’s the cost? I’m not talking about the price of observability tools here, I mean the instrumentation overhead.

Everything You Need to Know About Event Logs

Your code passes locally, CI is green, and the deploy goes through. Then production throws a 500, and the trace isn’t helpful. And here, event logs help. A log captures timestamped records of what the app did HTTP requests, DB queries, cache misses, retries, failures. These entries give you enough context to debug without reproducing the issue locally. Especially when dealing with distributed systems, logs are often the only consistent source of truth.

A guide to PHP exception handling

In most object-oriented languages, exceptions are an extremely powerful mechanism for dealing with unexpected situations that arise when running your code. PHP has supported robust exception handling since PHP 7.0. As you begin your programming journey, exceptions are a source of tremendous pain. Over time, you grow to appreciate the value they bring.

Thinking of a Career in Inclusion? Here's What You Need to Know

The modern workplace is changing. Inclusion is no longer just a buzzword, it's a foundation for growth, equity, and innovation in organizations around the world. Companies are recognizing that fostering a sense of belonging leads to better collaboration, increased retention, and stronger business outcomes. As a result, careers in inclusion and diversity are gaining momentum across a wide range of industries.

Which AI Service Providers Specialize in Modernizing Business Processes for Large Enterprises?

AI is on every business leader's lips for good reason. This game-changer can help any organization beat the competition and stay ahead of the curve. However, embracing AI is one thing, and implementing it optimally is another. You should welcome a third-party consultant early in your AI adoption journey to determine the most effective strategies to maximize it for modernizing business processes.

How AI is Helping Businesses Streamline Website Design Processes

Website design tools powered by artificial intelligence (AI) are redefining digital presence by streamlining development and enabling companies to build stunning websites. These sophisticated technologies let non-designers create professional websites by means of machine learning algorithms optimizing layouts, suggesting color palettes, and knowing consumer preferences. Using content organization and image selection, these technologies help businesses concentrate on building a strong digital presence and enhancing user experiences.