Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Predictive Maintenance: A Brief Introduction

Predictive maintenance is a maintenance strategy that uses machine learning algorithms trained with Industrial Internet of Things (IIoT) data to make predictions about future outcomes, such as determining the likelihood of equipment and machinery breaking down. Using a combination of data, statistics, machine learning and modeling, predictive maintenance is able to optimize when and how to execute maintenance on industrial machine assets.

An Overview of the Essential Observability Metrics

Metrics are closely associated with cloud infrastructure monitoring or application performance monitoring – we monitor metrics like infrastructure CPU and request latency to understand how our services are responding to changes in the system, which is a good way to surface new production issues. As many teams transition to observability, collecting metric data isn’t enough.

Migrating 1 billion log lines from OpenSearch to Elasticsearch

What are the current options to migrate from OpenSearch to Elasticsearch®? OpenSearch is a fork of Elasticsearch 7.10 that has diverged quite a bit from itself lately, resulting in a different set of features and also different performance, as this benchmark shows (hint: it’s currently much slower than Elasticsearch).

Is a $1 million Datadog bill worth it?

In a recent reddit thread, I got into a conversation about justifying the cost of observability. It got to a really basic question about running a tech company: how do you know that any cost is justified? While a small number of expenses have clear and direct business values, a bunch of other costs, I would even say most costs, just aren’t that clear cut.

Elasticsearch and Arduino: Better together!

An easy way to communicate with Elasticsearch and Elastic Cloud using Arduino IoT devices At Elastic®, we are constantly looking for new ways to simplify search experience, and we started to look at the IoT world. The collection of data coming from IoT can be quite challenging, especially when we have thousands of devices. Elasticsearch® can be very useful to collect, explore, visualize, and discover data — for all the data coming from multiple devices.

Container Orchestration: A Beginner's Guide

Container orchestration is the process of managing containers using automation. It allows organizations to automatically deploy, manage, scale and network containers and hosts, freeing engineers from having to complete these processes manually. As software development has evolved from monolithic applications, containers have become the choice for developing new applications and migrating old ones.

CapEx vs OpEx for Cloud, IT Spending, & More

Capital expenditures (CapEx) and operational expenditures (OpEx) are two ways organizations categorize their business expenses. Every organization has a variety of expenses, from office rent to IT infrastructure costs to wages for their employees. To simplify accounting, they organize these costs into different categories, two of the most common being CapEx and OpEx.

Ingesting and analyzing Prometheus metrics with Elastic Observability

In the world of monitoring and observability, Prometheus has grown into the de-facto standard for monitoring in cloud-native environments because of its robust data collection mechanism, flexible querying capabilities, and integration with other tools for rich dashboarding and visualization.

systemd journal logs: A Game-Changer for DevOps and Developers

“Why bother with it? I let it run in the background and focus on more important DevOps work.”— a random DevOps Engineer at Reddit r/devops In an era where technology is evolving at breakneck speeds, it's easy to overlook the tools that are right under our noses. One such underutilized powerhouse is the systemd journal. For many, it's a mere tool to check the status of systemd service units or to tail the most recent events (journalctl -f).

Centralized Logging & Centralized Log Management (CLM)

Centralized logging provides visibility into the system by consolidating all the log data in a single all-in-one source. It supports two particular enterprise needs: Once all the data is ingested in a central location, you can seamlessly identify the problems in systems and troubleshoot them. But with ease comes challenges, too. For example, your team members may struggle with locating their desired details from this sea of data.