Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Integration roundup: Understanding email performance with Datadog

Visibility into email health and performance is indispensable to any organization seeking to reach its customers through their inboxes. As they work to curtail spam, internet service providers (ISPs) are redefining the standards of deliverability on an ongoing basis, and organizations often struggle to adapt.

Balancing Load in Kafka: Strategies for Performance Optimization

Handling real-time data at scale? Apache Kafka is likely at the heart of your system. It’s robust, fast, and highly reliable. But as Kafka clusters grow, so does the complexity of maintaining balanced workloads across brokers and partitions. Without a solid strategy for distributing that load, you’re likely to run into bottlenecks, resource exhaustion, and consumer lag—none of which are fun to deal with. So, how do you keep your Kafka setup running efficiently and smoothly?

How the OpenTelemetry Collector Powers Data Tracing

OpenTelemetry, OTel, is an incredible open-source observability framework that helps you collect, process, and export trace data. It's super valuable for engineers who want to understand their systems better. At the heart of this framework lies the OpenTelemetry Collector, a pivotal component that turns raw traces into useful metrics. Let’s explore the importance of the OpenTelemetry Collector and show you how it makes it easier for engineers to make sense of data.

Grafana Cloud updates: The Explore apps suite for queryless data analysis, Adaptive Logs for cost optimization, and more

We consistently roll out helpful updates and fun features in Grafana Cloud, our fully managed observability platform powered by the open source Grafana LGTM Stack (Loki for logs, Grafana for visualization, Tempo for traces, and Mimir for metrics). And this month, on the heels of ObservabilityCON 2024 — our flagship observability event — we have no shortage of updates to share.

The OTTL Cookbook: Common Solutions to Data Transformation Problems

As our software complexity increases, so does our telemetry—and as our telemetry increases, it needs more and more tweaking en route to its final destination. You’ve likely needed to change an attribute, parse a log body, or touch up a metric before it landed in your backend of choice. At Honeycomb, we think the OpenTelemetry Collector is the perfect tool to handle data transformation in flight. The Collector can receive data, process it, and then export it wherever it needs to go.

How to Optimize NOC Efficiency with Operational Reports

In the fast-paced era of modern communications, staying on top of network operations is critical to ensuring optimal performance and minimizing downtimes. While networks become increasingly complex, just keeping the lights on and reacting to issues as they arise is no longer enough. Today’s network management demands not only real-time monitoring but also the ability to derive insights from comprehensive reports to provide an accurate picture of health, performance, and configuration.

The Journey to Autonomic IT: Mastering the Transition to Machine-Assisted IT

By now, you should be no stranger to Autonomic IT. The full realization of AIOps, combining AI, data, and automation to deliver a self-healing and self-optimizing IT infrastructure that operates autonomously, continuously monitoring and optimizing technology investments, and freeing up IT resources for innovation is on the horizon. In our first blog, we discussed Phase 1 of the Autonomic IT journey: Siloed IT.

Simplifying your experience: Sumo Logic's UI evolution

As organizations modernize their applications and deliver more complex, cloud-based services, the traditional boundaries between DevOps, SecOps, and ITOps are disappearing. Seamless collaboration between these teams, often referred to as DevSecOps, has become essential for efficiently addressing both reliability and security challenges.