Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Getting started with PromQL

This article will focus on the popular monitoring tool Prometheus, and how to use PromQL. Prometheus uses Golang and allows simultaneous monitoring of many services and systems. In order to enable better monitoring of these multi-component systems, Prometheus has strong built-in data storage and tagging functionalities. To use PromQL to query these metrics you need to understand the architecture of data storage in Prometheus, and how the metric naming and tagging works.

Top 5 Server Monitoring Tools

The need to monitor the health of servers and networks is unanimous. You don't want to be a blind pilot who is headed for an inevitable disaster. Fortunately, there are many open source and commercial tools to help you do the monitoring. As always, good and expensive are not as attractive as good and cheap. So, we've put together the most valuable cloud and windows monitoring tools to get you started.

Day 2 Navigate Europe 2023 Wrap Up

We kicked off the start of the Day 2 with our host Nigel Poulton as he prepared us with a quick rundown of the highlights from the first day before giving attendees a taste of what to expect from the rest of the event. After this point, Nigel brought Kelsey Hightower to the stage for his keynote session with Mark Boost and Dinesh Majrekar. If you missed our Day 1 recap, check it out here.

Deploy fully configured VMs in minutes on Google Cloud, using gcloud CLI and cloud-init

Every public cloud provider has a templating mechanism to deploy fully configured applications. For anyone interested in a vendor-neutral approach that works on major public clouds, cloud-init offers a good solution. Cloud-init makes your work re-usable regardless of the clouds you deploy to. It’s an open source configuration automation solution for Linux, which performs steps at various stages of a single machine’s boot up.

Multi-cluster Failover With A Service Mesh - Civo Navigate NA 2023

Dive deep into the world of Linkerd with Jason from Buoyant. Explore the power of service meshes in Kubernetes, understand the nuances of traffic management, and witness a seamless live failover demonstration. Discover how Linkerd enhances security, observability, and reliability in your applications.

Take Your Pick! The Best Server Monitoring Tools on the Market

IT professionals are always presented with myriad solutions when seeking additional software for their network infrastructure. When it comes to server monitoring solutions, there are multiple options available. After all, every organization has its own needs, individual infrastructure and software requirements. With that in mind, the following list is a guide to help IT professionals select what they believe may be the best possible server monitoring solution for their organization.

Hyperview Integrates Digitalor for Rack-Unit RFID Asset Tracking and Environmental Sensors

Vancouver, BC—September 13, 2023— Hyperview, a leading cloud-based data center infrastructure management (DCIM) platform provider, and Digitalor, a global leader in rack-unit MC-RFID asset tracking, have announced a strategic partnership that offers Hyperview users automated, real-time life cycle management for data centers and hybrid IT environments.

Deploying Single Node And Clustered RabbitMQ

RabbitMQ is a messaging broker that helps different parts of a software application communicate with each other. Think of it as a middleman that takes care of sending and receiving messages so that everything runs smoothly. Since its release in 2007, it's gained a lot of traction for being reliable and easy to scale. It's a solid choice if you're dealing with complex systems and want to make sure data gets where it needs to go.

Our first ML based anomaly alert

Over the last few years we have slowly and methodically been building out the ML based capabilities of the Netdata agent, dogfooding and iterating as we go. To date, these features have mostly been somewhat reactive and tools to aid once you are already troubleshooting. Now we feel we are ready to take a first gentle step into some more proactive use cases, starting with a simple node level anomaly rate alert. note You can read a bit more about our ML journey in our ML related blog posts.

Unlocking IT: Considerations for a Powerful Observability Strategy

In today's cloud-native landscapes, observability is more than a buzzword; it's a critical element for software development teams looking to master the complexities of modern environments like Kubernetes. There’s a multi-faceted nature to observability with all its various levels and dimensions — from basic metrics to comprehensive business insights. It’s complex and can continue indefinitely…if you let it.