Operations | Monitoring | ITSM | DevOps | Cloud

Configuring an OpenTelemetry Collector to connect to BindPlane OP

Bindplane OP is the first open source, vendor-agnostic, agent and pipeline management tool. It makes it easy to deploy, configure, and manage agents on thousands of sources, and ship metrics, logs, and traces to any destination. This blog shows you how to configure an existing OpenTelemetry Collector from any source to connect to Bindplane OP without needing to remove or reinstall the collector.

What is Kubernetes CrashLoopBackOff? And how to fix it

CrashLoopBackOff is a Kubernetes state representing a restart loop that is happening in a Pod: a container in the Pod is started, but crashes and is then restarted, over and over again. Kubernetes will wait an increasing back-off time between restarts to give you a chance to fix the error. As such, CrashLoopBackOff is not an error on itself, but indicates that there’s an error happening that prevents a Pod from starting properly.

An Introduction to PromQL: How to Write Simple Queries

PromQL is a flexible language designed to make it easy for users to perform ad-hoc queries against their data. By default, Prometheus indexes all of the fields in each metric except for source and target, which are not indexed by default. Prometheus is an open-source tool that lets you monitor Kubernetes clusters and applications. It collects data from monitoring targets by scraping metrics HTTP endpoints.

New in Grafana Alerting: File provisioning

We are happy to announce that file provisioning for Grafana Alerting has arrived in Grafana 9.1. This feature enables you to configure your whole alerting stack using files on disk, as you may already do with data sources or dashboards. The Terraform Grafana provider has also been updated to allow the provisioning of Grafana Alerting resources.

What are Canary Deployments and Why are they Important?

Every modification to software comes with the potential for production problems. Application failures often have serious consequences which can result in a loss of revenue and a poor customer experience. Additionally, organizations constantly try to improve their services for a better customer experience. How can you minimize the chance of error and update your application with confidence?

Intro to OEE

Efficient manufacturing is important for saving companies time, money, and energy. Making decisions based on data can improve efficiency, but there’s a lot of data to sort through. Manufacturing equipment contains many sensors, especially in the IIoT space. Overall Equipment Effectiveness (OEE) was first described by Seiichi Nakajima in the mid-twentieth century as part of his Total Productive Maintenance (TPM) method.

Ansible Key Terms: Getting Started

If you’re a systems administrator, there’s a good chance you’ve heard of Ansible. But if you’re not familiar with the tool or just getting started with it, there are some key terms and concepts you need to know. Here we will give you an overview of Ansible, from its origins to the latest features. We’ll also cover some of the key terminology associated with Ansible so you can start using it effectively immediately.

ROI Benefits of APM Tools

Software applications have become crucial for business growth and success in today's world. However, as businesses become increasingly competitive, the necessity to provide top-notch software applications is also increasing. Additionally, as organisations gravitate towards developing extensive, feature-rich applications, they are witnessing an increase in software complexity – that can often cause things to get out of hand very quickly.

N-able recognized as Number 1 RMM software in CRN ARC Awards

For the second year running, CRN readers have voted N-able as the Number 1 RMM software in the Annual Report Card (ARC) Awards. The ARC Awards are one of the most prestigious honors in the IT industry, highlighting vendors’ commitment to partner satisfaction and reinforcing their dedication to channel partners through nurturing and growing partner relationships.

incident.io + Indent - on-demand system access

At incident.io, we empower teams to run incidents quickly and effectively from start to finish. One of the ways we help is by taking the manual admin out of your incidents. More often than not, folks are spending too much time thinking about the process, when the time would be better spent focusing on fixing. Our automated workflows, nudges and prompts help to embed best practices and unlock time for more impactful work.