Operations | Monitoring | ITSM | DevOps | Cloud

Custom OpenTelemetry Collectors: Build, Run, and Manage at Scale

I tried thinking back to when the last time I read an actual tutorial that did not include a bunch of em (—) dashes, semicolons, normal dashes, and an unnervingly large quantity of the phrases like “XYZ-thing Alert ” and “Exciting News!”. Well, hold on to your suspenders folks, here we go again. Part 2 is up and it’s a controversial one.

OpenTelemetry Deep Dive: Resilience & High Availability in the OTel Collector

Missed it live? Catch the full recording of OpenTelemetry Deep Dive: Resilience & High Availability in the OTel Collector — a 1-hour workshop on building telemetry pipelines that never drop a signal. We’ll show you why resilience matters, how to design high-availability architectures, and how to configure the OpenTelemetry Collector with retries, batching, and persistent queues. Plus, you’ll see live demos in both Docker and Kubernetes — including scaling Gateway collectors with an HPA — and how Bindplane makes large-scale management seamless.

Kafka Performance Crisis: How We Scaled OpenTelemetry Log Ingestion by 150%

When your telemetry pipeline starts falling behind, the countdown to production impact has already begun. One Bindplane customer operating a large-scale log ingestion pipeline built on the OpenTelemetry Collector and Kafka hit that breaking point. Instead of keeping pace with incoming data, their pipeline was ingesting just 12,000 events per second (EPS) per partition/collector—and this Kafka topic had 16 partitions. In aggregate, that was roughly 192K EPS.

Resilience with Zero Data Loss in High-Volume Telemetry Pipelines with OpenTelemetry and Bindplane

This was the problem one Bindplane customer had with processing enormous S3-stored log files. Our engineering team tackled the problem head-on, enhancing the S3 event receiver with offset tracking and chaos testing methodologies.

How to Build Resilient Telemetry Pipelines with the OpenTelemetry Collector: High Availability and Gateway Architecture

Let’s bring that back. Today you’ll learn how to configure high availability for the OpenTelemetry Collector so you don’t lose telemetry during node failures, rolling upgrades, or traffic spikes. The guide covers both Docker and Kubernetes samples with hands-on demos of configs. But first, let’s lay some groundwork.

Bindplane Recommendation Engine: Automatically Improve Telemetry Parsing #opentelemetry #collector

Check out the full ‪‪@bindplane community call in June. See how Bindplane instantly suggests improvements using its recommendation engine. This video explores how to automatically parse severity with default values, enhancing data analysis efficiency. Learn how to quickly optimize your setup.

Blueprints Are Pre-Built Processor Bundles #opentelemetry #collector #observability

Check out the full ‪‪@bindplane community call in June. Here we explore automated JSON parsing using a JSON Parse processor bundle that was added with the Blueprint feature in Bindplane. Learn how to parse JSON strings, extract fields, and set accurate timestamps without having to add any custom configs. Bindplane handles all the heavy lifting automatically.

Regex Log Parsing Made Easy with AI/LLM Support #opentelemetry #collector #observability

Check out the full ‪‪@bindplane community call in June. We explore Apache HTTP source and the new AI regex log parsing capabilities. We leverage a Bindplane processor for complex pattern matching, enabling efficient data processing. This guide demonstrates how to easily generate and apply regex patterns with AI support.