Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Log Management, Log Analytics and related technologies.

From Downtime to Uptime: Monitoring Tools and Techniques for Systems, Websites, APIs, and More

Recently, while visiting a friend in a local hospital, I found myself facing a frustrating distraction: trying to pay parking fees using USSD (a mobile text-based system for quick transactions). The service was either painfully slow or not working at all. I wasn’t alone. Other visitors were just as exasperated, and parking attendants stood idle, their handheld devices frozen in endless loading loops.

Top Features of Splunk Observability Cloud for Engineers

In this video we’ll walk you through a demonstration of Splunk Observability Cloud’s key capabilities. You’ll see how you can monitor Kubernetes cluster health in Infrastructure Monitoring, and alert on your services’ health using AutoDetect Detectors and Alerts. We’ll then take a look at traces and metrics in APM, and use Related Content to find correlated log entries of error traces. Then we’ll use AlwaysOn Profiling to troubleshoot long duration traces for our service.

Shift-Left Monitoring for GitHub and Vercel Workflows

A recent LinkedIn poll by Peter Zaitsev asked: “What is the most common preventable cause of downtime in your environment?” Guess what most respondents said it was? Surprise, surprise – the top answer is Deploying Broken Code, with 57% of respondents selecting it. This reinforces how critical it is to catch issues before they hit production.

How to Monitor Frontend Memory Usage

First of all, by frontend memory usage I mean the amount of memory that a user’s browser needs when using your website or webapp. Secondly, do you have any idea how much browser memory your website or webapp requires? Or do you know if or how much the memory footprint of your website/webapp has changed over the last few months? Or after the recent changes or releases you made? I’m guessing you don’t. Yet, this is important to monitor to avoid a bad user experience.

Solve your MTTR mysteries faster with Sumo Logic

Picture this: a crime scene where the evidence is scattered across five different rooms. There’s a footprint in one, a shattered window in another, a stray shoe on the stairs, and a witness across the street, who only saw part of what happened. Each clue matters in solving the case, but none of them tells the full story on their own.

Why Cribl Copilot Editor is Built for the Human, First and Foremost

I’m genuinely excited about what we're rolling out with Copilot Editor, an update to our AI that’s truly packed with new capabilities designed to help you automate pipeline development. You can read about these capabilities here. I wanted to take a moment to share our thinking on a core principle that guides how we build, especially regarding the impactful, and sometimes daunting, world of generative AI.

Blueprints: Ready-Made Processor Bundles For Your Telemetry Pipelines

We’ve noticed a lot of our customers spend countless hours building and configuring processors. Either parsing JSON, standardizing log formats, normalizing timestamps, masking PII, de-duplicating logs, the list never ends. Most work revolves around recreating the same processor bundles in multiple processor nodes. Bindplane’s new Blueprints solves that boring, repetitive work by providing pre-built processor bundles you can drop into any pipeline with a single click.

Scaling Observability: How We Designed Bindplane to Manage 1,000,000 OpenTelemetry Collectors

Join the live stream at 11 am ET, here. Platform teams tend to start with just one, or in some cases a handful of OpenTelemetry (OTel) Collectors usually running in gateway mode. They then embrace the benefit of a vendor-neutral, standardized, telemetry collector for unified logs, metrics, and traces.