Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Get started with Grafana Alerting: Multi-dimensional alerts and how to route them

In this tutorial, we dig into more complex yet equally fundamental elements of Grafana Alerting: alert instances and notification policies. Don't miss the rest of the "Get started with Grafana Alerting" series! Each part dives into a different feature to help you get the most out of alerting in Grafana.

The AI Monitoring crisis that no one's talking about

When I spoke at AWS London earlier this year, I had the chance to discuss something that more and more teams are starting to feel: traditional observability doesn’t cut it for AI systems. In AI, “Is it running?” is no longer enough. We have to ask, “Is it right?” When I delivered that line, I saw the heads nodding. Everyone’s excited to build with LLMs, but when it comes to actually monitoring them in production? That’s where things fall apart.

Autonomous Operations Are Here

ScienceLogic’s vision for IT operations isn’t just about improving tools—it’s about changing the entire paradigm, flipping your day-to-day upside down. We’re moving beyond dashboards and alerts, beyond human-only workflows and rules-based systems. The future is autonomous. It’s intelligent. It’s agentic. And it’s already being realized through the power of Skylar AI.

IT Service Performance Monitoring: Key Metrics, Best Practices, and Future Trends

As organizations rely more on complex IT systems and cloud-based services, keeping everything running smoothly — and reliably — has become a top priority. That’s where IT service performance monitoring comes in, giving teams the visibility they need to make sure systems stay healthy and responsive. By tracking a range of technical and user-focused metrics, businesses can quickly identify and address issues before they impact operations or end users.

The Hidden Cost of Not Using APM in Production

Many organizations don’t realize how important it is to monitor how their applications run in production. Without Application Performance Monitoring (APM), it becomes difficult to detect and resolve issues quickly, leading to increased downtime, wasted developer effort, and poor user experience. These hidden costs, though not always visible at first, can impact customer satisfaction, reduce team efficiency, and result in lost revenue.

Will AI Speed Development in Your Legacy App?

Some people can get an AI assistant to write a day’s worth of useful code in ten minutes. Others among us can only watch it crank out hundreds of lines of crap that never works. What’s the difference? There are some skills specific to AI development. There are also properties of the codebase we’re working in that make it amenable to AI assistance. Most AI demos use projects created from scratch with AI in mind—cute.

From Sequential Bottlenecks to Concurrent Performance: Optimizing Log Processing at Scale

We optimized log processing pipeline by moving from sequential to concurrent processing at the entry level, achieving 30% higher throughput and better resource utilization without increasing infrastructure costs. When customers start sending millions of logs per minute, you quickly discover whether your processing pipeline can actually scale with vertical scaling.

VirtualMetric Earns ISO 27001:2022 Certification: Security at Every Level

We’re excited to share that VirtualMetric has officially achieved ISO 27001:2022 certification, a globally recognized standard for building and managing an effective Information Security Management System (ISMS). This confirms that we’ve implemented robust controls to protect data, manage risks, and ensure the resilience of our infrastructure in today’s security landscape.