Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Sysdig Team - What does good collaboration look like?c

In this video, our team shares how we work together to move fast, stay aligned, and build impact- across engineering, product, design, marketing, and beyond. You’ll hear honest perspectives on: Whether you're part of Sysdig or just curious how high-performing teams operate, this behind-the-scenes look highlights the mindset and culture that power everything we do.

OpenTelemetry Java Agent for Spring Boot: Complete Setup Guide

The OpenTelemetry Java Agent provides zero-code instrumentation for Spring Boot applications through bytecode manipulation. This guide covers setup, configuration, auto-instrumentation capabilities, and production deployment strategies for implementing distributed tracing and observability.

Understand, diagnose, and optimize SQL queries: Introducing Grafana Cloud Database Observability

It’s widely acknowledged that most application performance problems stem not from the application itself, but from the underlying database. Slow or inefficient database queries are often the primary cause of these issues, acting as the biggest driver of application performance incidents. If you’ve been troubleshooting slow API calls or sluggish services, chances are the root cause likely resides within your database layer.

Network Monitoring vs. Network Observability: What Do You Need?

A decade ago, network monitoring was straightforward. You had a data center, some branch offices, MPLS circuits connecting everything, and a handful of applications running on-premises. Set some SNMP thresholds, configure a few alerts, and you were covered. When something broke, the problem was usually obvious: a failed switch, a saturated link, a misconfigured router. Today's networks bear zero resemblance to that world.

Introducing the Splunk Technology Add on for Ollama Illuminating Shadow AI Deployments

Without strong visibility and governance, local LLMs risk replicating the fragmented, unsupervised sprawl once seen in shadow IT, complicating security postures and making it difficult for organizations to ensure proper oversight and compliance as these powerful AI tools become embedded in daily workflows. To address this challenge, The Splunk Threat Research Team has released the Splunk Technology Add-on for Ollama that provides comprehensive monitoring and observability capabilities specifically designed for local LLM deployments.

Your NOC's Most Important New Skill? Ignoring Things

I want to challenge a deeply held belief in our industry, one that I once championed myself: the idea that more data is the answer. We've spent a fortune building vast data lakes of network telemetry, believing that if we could just collect everything, we would achieve a state of operational nirvana.

The Seven Wastes of Network Operations

Does it ever feel like your network operations team is constantly running, yet always struggling to keep up? The ticket queues are long, troubleshooting is a complex detective story, and every new application deployment adds another layer of anxiety. This constant state of reactive firefighting isn't a sign of a bad team; it's the symptom of a broken process. This operational friction, the invisible tax on every action your team takes, has a name: waste.