Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Log Management, Log Analytics and related technologies.

Why we open-sourced AURA: Infrastructure for production AI

Over the last year, I’ve talked to dozens of SRE teams about AI. The excitement is real, but conversations hit a wall when we get to production reality. How does an agent manage complex context without losing the plot? How does it avoid hallucinating relationships between signals? Who owns the orchestration logic that ties it all together? We realized the bottleneck wasn’t model intelligence. It was the lack of a reliable logic layer between the data and the model.
Sponsored Post

What is a Real-Time Data Lake?

A data lake is a centralized data repository where structured, semi-structured, and unstructured data from a variety of sources can be stored in their raw format. Data lakes help eliminate data silos by acting as a single landing zone for data from multiple sources. But what's the difference between a traditional data lake and a real-time data lake? Some traditional data lakes use batch processing, which involves processing and analyzing a collection of data that has been stored over a specific timeframe. For example, payroll and billing systems that are handled on a weekly or monthly basis might use batch processing.

Your Data is Whispering and Needs a Human to Listen

If you have ever owned, operated, or supported a piece of technology, you have probably built a dashboard. Maybe it started as a quick chart to answer a simple question, then quietly grew into something more important. Dashboards are often created by the people who know the systems best, the ones who can wire together data sources and click all the right buttons. But those same builders are rarely trained in how humans actually interpret data.

AI can do what now?! What an ethical hacker says about deepfakes and AI

Real-time camera deepfakes are no longer science fiction. High-fidelity, AI-generated impersonation may be advancing quickly — but that's not the only AI risk financial services companies should be thinking about. In this episode of AI Can Do What Now?!, Lisa Jones-Huff, director of security solutions architecture at Elastic, sits down with ethical hacker Freakyclown (FC) to explore what is technically possible today with AI, where reality still falls short of the hype, and what security teams should be worried about.

AI can do what now?! The real risks of AI in social engineering

What is the most immediate risk financial services companies face today? AI-enabled social engineering is already accelerating real-world attacks. Scale, personalization, speed, and automation are lowering the barrier for attackers while making fraud detection more complex for defenders. In this episode of AI Can Do What Now?!, Lisa Jones-Huff, director of security solutions architecture at Elastic, is joined by ethical hacker Freakyclown (FC), and principle solutions architect Joe Murin to explore what is actually happening right now — beyond the hype.

Build a Unified Operational Ecosystem with ServiceNow and Coralogix

During high-priority incidents, SRE teams frequently lose critical time switching between monitoring platforms and ticketing systems. Context switching like this forces engineers to manually update incident states by copying and pasting data. The inevitable result is increased risk of information gaps and slower Mean Time to Recovery (MTTR).

Powering Security Innovation: Executive Q&A on Splunk Joining AWS Security Hub Extended

To succeed in the AI era, customers need fast, easy access to security solutions that can harness the power of agentic AI and deliver business outcomes. They need seamless access to their data for faster threat detection, simpler incident response, and reduced risk. They need technology vendors to work together and not in silos.

Colsubsidio transforms business process monitoring with Elastic Observability

Colsubsidio is one of the largest and most representative family compensation funds in Colombia. The organization manages and delivers essential social services to millions of users through a broad network spanning health, education, subsidies, recreation, tourism, credit, housing, pharmacies, retail supply, culture, and labor welfare.

Claude Code + OpenTelemetry: Per-Session Cost and Token Tracking

I was looking at our Claude Code spend in the Anthropic console the other day. Aggregate cost, aggregate tokens — no breakdown by developer, no breakdown by session. I knew my Hackathon team had been using it heavily on building out new features for the OpenTelemetry Distro Builder. But heavily how? I had no idea. Turns out Claude Code has been emitting OpenTelemetry signals the whole time. Per-session cost, token counts, every tool call it makes on your codebase.