Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Tutorial: Elasticsearch Snapshot Lifecycle Management (SLM)

Let’s face it, nothing is perfect. The better we architect our systems, though, the more near-perfect they become. But even so, someday, something is likely to go wrong, despite our best effort. Part of preparing for the unexpected is regularly backing up our data to help us recover from eventual failures and this tutorial explains how to use the Elasticsearch Snapshot feature to automatically backup important data.

Solr vs. Elasticsearch: Who's The Leading Open Source Search Engine?

Searches are integral parts of any application. Performing searches on terabytes and petabytes of data can be challenging when speed, performance, and high availability are core requirements. This blog post will pit Solr vs Elasticsearch, two of the most popular open source search engines whose fortunes over the years have gone in different directions. Both of them are built on top of Apache Lucene, so the features they support are very similar.

How Gremlin monitors its own Chaos Engineering service with Datadog

Reliable systems are vital to meeting customer expectations. Downtime not only hurts a company’s bottom line but can be detrimental to reputation. Our goal at Gremlin is to help enterprises build more reliable systems using Chaos Engineering. Whether your infrastructure is deployed on bare metal in a corporate-owned data center or as Kubernetes-orchestrated microservices in a public cloud, chaos experiments can help you find system weaknesses early, before they affect customers.

Sponsored Post

Introducing the ITOM podcast: Listen and learn how to avoid remote work roadblocks in an IT environment

In administrating all technology and application requirements within an organization, IT operations management (ITOM) is pretty complex, and tends to send IT admins scrambling for authentic and actionable insights across the internet. We’re taking matters into our own hands and launching our very own podcast series to provide you valuable information on ITOM, which you can choose to listen to at your leisure or on the go!

Autoscaling Puppet compile masters with AWS

In classic Puppet deployment architecture, compile masters are widely used when the number of managed nodes goes up. Multiple compile masters sit behind a load balancer to take care of the additional workloads. It is not rare to see Puppet adopters launching the compile masters in the public cloud, such as Amazon Web Service (AWS) and Google Cloud Platform.

What is Azure VM Insights?

Microsoft recently announced general availability of Azure VM Insights, aka Azure Monitor for VM. This service is basically a set of features that allow you to monitor your VMs in more detail, from collecting the telemetry from your VM to displaying it meaningfully – all with a single click. I am satisfied with Azure VM Insights for the most part, but I also have some mixed feelings about it. Read on to find out why.

Freshservice launches Return-to-work app to help workplaces reopen safely

As companies prepare for the workforce to return to work, a myriad of decisions must be made. Return to work policies must encompass a full understanding of the needs and questions of its employees. The other top need is a true assessment of safety as it relates to COVID-19. How does the new place differ from one that is familiar? From the perspective of the employee, there may be sentiments of fear, uncertainty, and reluctance.

ITSM Change Management to Control Continuous Cost Optimization

I’ve been writing about continuous cloud optimization for a while now, and recently, I’ve spoken with several organizations to understand any challenges they’re currently facing in their automation journey. Their insights would help us understand how we can improve our technology to better support them. I discovered two fundamental themes behind their challenges.