Operations | Monitoring | ITSM | DevOps | Cloud

Achieving Great Dynamic Sampling with Refinery

Refinery, Honeycomb’s tail-based dynamic sampling proxy, often makes sampling feel like magic. This applies especially to dynamic sampling, because it ensures that interesting and unique traffic is kept, while tossing out nearly-identical “boring” traffic. But like any sufficiently advanced technology, it can feel a bit counterintuitive to wield correctly, at first. On Honeycomb’s Customer Architect team, we’re often asked to assist customers with their Refinery clusters.

Automated Employee Onboarding: The Gamechanger for New Hires and IT Teams

Too many IT tickets, not enough time. That’s just one problem that comes with a poor employee onboarding experience, and it’s one that causes deal-breaking difficulties for new employees, and just as importantly, IT leaders in the IT service management (ITSM) department. Ninety-three percent of employers said that a good onboarding experience is critical for retention of new employees, according to market share data from Finances Online. The total cost of voluntary turnover in 2020?

Reduce MTTR and Take Automation to a New Level with PagerDuty Global Event Orchestration

PagerDuty’s Global Event Orchestration is now generally available. Global Event Orchestration’s powerful decision engine enriches events, controls their routing, and triggers self-healing actions based on event data. Teams can use this functionality across any or all services within PagerDuty. This feature is a continued investment in Event Orchestration, demonstrating PagerDuty’s commitment to providing customers with best-in-class automation capabilities.

Beyond 6: What's Next for Wi-Fi?

Somewhere lost in the noise of ‘faster’ and ‘better’ marketing messaging around Wi-Fi in the last few years, there were also some genuine transformations in what Wi-Fi is capable of. And what’s on the horizon maybe some of the most significant improvements since the inception of Wi-Fi itself. Let’s look at where Wi-Fi is at, what big changes have happened, and where we’re going next in the land of Wi-Fi technology.

Confidently Manage Multiple Projects with Sentry's New Spend Allocation and Spike Protection

Today we’re announcing our new Spend Allocation feature and updates to Spike Protection, giving you more control over how your projects consume events. While we’ve made it super easy for teams to add Sentry to their projects, we kept hearing from the community that they wanted more guardrails to ensure their noisy projects don’t eat through their event quota.

Proactively track, triage, and assign issues with Datadog Case Management

Complex systems require many different monitors to assess the health of their infrastructure and applications, creating a wealth of alerts that can be hard to track. Due to a lack of effective triage processes, many organizations page engineers for every alert that comes in, making it difficult to separate false positives from issues that actually require immediate attention.

What Is Observability? Everything a Beginner Needs to Know

Observability originated in the field of engineering and has recently gained popularity in the world of software development. Put simply, observability refers to the ability to understand the internal state of a system based on its external outputs. IBM defines it as follows: As systems have become more complex, often including remote elements in cloud-based systems, management of the systems and troubleshooting faults and downtime have also become more complex.

SLO vs SLA vs SLI: A Complete Guide for DevOps Professionals

In today’s fast-paced world of software development, DevOps professionals strive to provide high-quality and dependable services for their users. An essential aspect of achieving this objective is understanding and effectively managing service level indicators (SLIs), service level objectives (SLOs) and service level agreements (SLAs). These metrics help guarantee that a service meets its performance and reliability targets.

How to get Azure Data Factory Pipeline Failure Notification?

Azure Data Factory is a cloud-based data integration tool focusing on data extraction, transformation, and loading. A pipeline in Azure Data Factory is a collection of processes that move data to a shared repository, such as a data warehouse. Why it is important to monitor Azure Data Factory pipeline failures?

Docker monitoring 101: Tools, key features, metrics, and more

Docker is a well-known open-source platform that is predominantly employed to bundle applications and their dependent components into containers for easy development and deployment. Docker is lightweight and efficient in resource consumption by operating as an executable packaged software with all the necessary framework, libraries, code, runtime, and files required to run an application.