Operations | Monitoring | ITSM | DevOps | Cloud

Configuring an OpenTelemetry Collector to connect to BindPlane OP

Bindplane OP is the first open source, vendor-agnostic, agent and pipeline management tool. It makes it easy to deploy, configure, and manage agents on thousands of sources, and ship metrics, logs, and traces to any destination. This blog shows you how to configure an existing OpenTelemetry Collector from any source to connect to Bindplane OP without needing to remove or reinstall the collector.

What is Kubernetes CrashLoopBackOff? And how to fix it

CrashLoopBackOff is a Kubernetes state representing a restart loop that is happening in a Pod: a container in the Pod is started, but crashes and is then restarted, over and over again. Kubernetes will wait an increasing back-off time between restarts to give you a chance to fix the error. As such, CrashLoopBackOff is not an error on itself, but indicates that there’s an error happening that prevents a Pod from starting properly.

An Introduction to PromQL: How to Write Simple Queries

PromQL is a flexible language designed to make it easy for users to perform ad-hoc queries against their data. By default, Prometheus indexes all of the fields in each metric except for source and target, which are not indexed by default. Prometheus is an open-source tool that lets you monitor Kubernetes clusters and applications. It collects data from monitoring targets by scraping metrics HTTP endpoints.

Using StatusPage at squadcast | SRE Best practices | Squadcast

Let your customers know how your Services are doing, without them having to ask you about it. One of the core principles of SRE is Transparency and Status Pages help you communicate the status of your Services to your customers at all times, as opposed to you getting to know the status of your Services through support tickets logged by your customers.

New in Grafana Alerting: File provisioning

We are happy to announce that file provisioning for Grafana Alerting has arrived in Grafana 9.1. This feature enables you to configure your whole alerting stack using files on disk, as you may already do with data sources or dashboards. The Terraform Grafana provider has also been updated to allow the provisioning of Grafana Alerting resources.

What are Canary Deployments and Why are they Important?

Every modification to software comes with the potential for production problems. Application failures often have serious consequences which can result in a loss of revenue and a poor customer experience. Additionally, organizations constantly try to improve their services for a better customer experience. How can you minimize the chance of error and update your application with confidence?

Intro to OEE

Efficient manufacturing is important for saving companies time, money, and energy. Making decisions based on data can improve efficiency, but there’s a lot of data to sort through. Manufacturing equipment contains many sensors, especially in the IIoT space. Overall Equipment Effectiveness (OEE) was first described by Seiichi Nakajima in the mid-twentieth century as part of his Total Productive Maintenance (TPM) method.

Ansible Key Terms: Getting Started

If you’re a systems administrator, there’s a good chance you’ve heard of Ansible. But if you’re not familiar with the tool or just getting started with it, there are some key terms and concepts you need to know. Here we will give you an overview of Ansible, from its origins to the latest features. We’ll also cover some of the key terminology associated with Ansible so you can start using it effectively immediately.

ROI Benefits of APM Tools

Software applications have become crucial for business growth and success in today's world. However, as businesses become increasingly competitive, the necessity to provide top-notch software applications is also increasing. Additionally, as organisations gravitate towards developing extensive, feature-rich applications, they are witnessing an increase in software complexity – that can often cause things to get out of hand very quickly.