Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Continual Learning in AI: How It Works & Why AI Needs It

Like humans, machines need to continually learn from non-stationary information streams. While this is a natural skill for humans, it’s challenging for neural networks-based AI machines. One inherent problem in artificial neural networks is the phenomenon of catastrophic forgetting. Deep learning researchers are working extensively to solve this problem in their pursuit of AI agents that can continually learn like humans.

Advantages of an AI-Powered Observability Pipeline

The expenses associated with collecting, storing, indexing, and analyzing data have become a considerable challenge for organizations. This data is growing as fast as 35% a year, multiplying the problems. This surge in data comes with a corresponding rise in infrastructure costs. These costs often force organizations to make decisions about what data they can afford to analyze, which tools they must use, and how and where to store data for long-term retention.

Splunk second thoughts? It's time for the cloud-native alternative

Back in September when Cisco announced they were acquiring Splunk, we explained how the market was consolidating with Sumo Logic ahead of the pack, challenging traditional vendors with our cloud-native platform. Now that the deal is complete and Splunk is officially a Cisco company, we’re hearing from more Splunk customers who are considering their options.

What is Log Analytics?

There is observation then there’s analysis. Log Analytics falls under the latter category. Observation and analysis are not mutually exclusive; one builds upon the other. Similarly, Log analytics advances beyond simple log monitoring, enabling observability teams to identify trends and irregularities throughout your enterprise. To demystify what is Log Analytics, let’s first have a look at the definition.

Elastic Search 8.13: Simplifying embedding and ranking for developers

Elastic Search 8.13 extends the capabilities that enable developers to use artificial intelligence and machine learning models to create fast and elevated search experiences. Integrated with Apache Lucene 9.10, measured vector search performance has exceeded 2x in benchmarks, extending the sophistication of searches that can be performed in near real time.

The Ultimate CPU Alert - Reloaded, Again!

It’s been nearly ten years since “The Ultimate CPU Alert – Reloaded” and its Linux version were shared with the SolarWinds community. At that time, managing CPU data from 11,000 nodes, with updates every five minutes to a central MSSQL database, was a significant challenge. The goal was to develop alerting logic to identify when a server was experiencing high CPU usage accurately.

Turning Logs into Metrics with OpenTelemetry and BindPlane OP

Turning logs into metrics isn’t a new concept. A version of this functionality is implemented in most agents, visualization tools, and backends. It’s everywhere because converting logs to metrics has many practical applications and is one of the fundamental mechanisms for controlling log volume in a telemetry pipeline. In this post, I’ll briefly overview log-based metrics, explain why they matter, and provide examples of how to build them using OpenTelemetry and BindPlane OP.

Webinar Recap: Myths and Realities in Telemetry Data Handling

Telemetry data is growing exponentially, but the business value isn’t increasing at a similar pace. Getting the right telemetry data is hard, so I recently had a conversation with Matt Aslett, Director of Research at Ventana Research, now a part of ISG, about five myths and realities in telemetry data handling.

Mastering Log Retention Policy: A Guide to Securing Your Data

The strategic implementation of a security log retention policy is critical for safeguarding digital assets and key company data. This practice is foundational for detecting and analyzing security threats in real-time and conducting thorough post-event investigations. Integrating the nuances of log analytics system costs, which escalate with data volume due to the infrastructure needed for storage and processing, highlights a critical aspect of security log retention.

Maximize IT efficiency leveraging alert management with Elastic AI Assistant for Observability

Manage and correlate signals and alerts in Elastic Observability As organizations embrace increasingly complex and interconnected IT systems, the sheer volume of alerts generated by diverse monitoring tools has given rise to a critical challenge — how do we efficiently sift through the noise to identify and respond to the most crucial issues? Event management and correlation are two indispensable pillars in the realm of IT service management.