Operations | Monitoring | ITSM | DevOps | Cloud

%term

Integration roundup: Understanding email performance with Datadog

Visibility into email health and performance is indispensable to any organization seeking to reach its customers through their inboxes. As they work to curtail spam, internet service providers (ISPs) are redefining the standards of deliverability on an ongoing basis, and organizations often struggle to adapt.

Balancing Load in Kafka: Strategies for Performance Optimization

Handling real-time data at scale? Apache Kafka is likely at the heart of your system. It’s robust, fast, and highly reliable. But as Kafka clusters grow, so does the complexity of maintaining balanced workloads across brokers and partitions. Without a solid strategy for distributing that load, you’re likely to run into bottlenecks, resource exhaustion, and consumer lag—none of which are fun to deal with. So, how do you keep your Kafka setup running efficiently and smoothly?

How to Set Up Availability Monitoring in SolarWinds Digital Experience Monitoring

Learn how to set up availability monitoring in SolarWinds Observability Platform’s Digital Experience Monitoring (DEM). This tutorial walks you through configuring synthetic probes, setting up website monitoring, adjusting test intervals, and enabling SSL certificate monitoring. Follow along to ensure your website stays available and catch issues before your users do!

Runbook Automation and Rundeck v5.6 Release Notes

The Runbook Automation and Rundeck product team are back with release v5.6, featuring some security updates and fixes, plus lots of contributions from Rundeck’s amazing open source community. Plus, Forrest takes us through some of the projects that community members can contribute to themselves, including the documentation and plugins.

Ask the Expert: "How Can I Put Responsible AI Into Practice?"

The ethos – and existential urgency – driving “responsible AI” policies is rooted in themes such as privacy, security, fairness, and transparency. Many headlines blur AI concepts such as machine learning, adaptive algorithms, deep learning, natural language processing, and generative AI (GenAI) technologies that are driving the current AI boom. People are worried about their data being misused, their words misconstrued, and their work misrepresented.

How the OpenTelemetry Collector Powers Data Tracing

OpenTelemetry, OTel, is an incredible open-source observability framework that helps you collect, process, and export trace data. It's super valuable for engineers who want to understand their systems better. At the heart of this framework lies the OpenTelemetry Collector, a pivotal component that turns raw traces into useful metrics. Let’s explore the importance of the OpenTelemetry Collector and show you how it makes it easier for engineers to make sense of data.

Grafana Cloud updates: The Explore apps suite for queryless data analysis, Adaptive Logs for cost optimization, and more

We consistently roll out helpful updates and fun features in Grafana Cloud, our fully managed observability platform powered by the open source Grafana LGTM Stack (Loki for logs, Grafana for visualization, Tempo for traces, and Mimir for metrics). And this month, on the heels of ObservabilityCON 2024 — our flagship observability event — we have no shortage of updates to share.

The OTTL Cookbook: Common Solutions to Data Transformation Problems

As our software complexity increases, so does our telemetry—and as our telemetry increases, it needs more and more tweaking en route to its final destination. You’ve likely needed to change an attribute, parse a log body, or touch up a metric before it landed in your backend of choice. At Honeycomb, we think the OpenTelemetry Collector is the perfect tool to handle data transformation in flight. The Collector can receive data, process it, and then export it wherever it needs to go.