Operations | Monitoring | ITSM | DevOps | Cloud

APImetrics + Squadcast: Routing Alerts Made Easy

APImetrics is an API Compliance, Monitoring and Security solution that lets you make and run API calls or sequences of API calls (workflows) from external, remote cloud locations using exactly the same security configurations as a typical end user would use. If you use APImetrics for API calling requirements, you can integrate it with Squadcast, an end-to-end incident response tool, to route detailed alerts from APImetrics to the right users in Squadcast.

Goodbye LAN. The Internet is the Network.

We are at the cusp of an important technology transformation. A discontinuity in technology as Peter Drucker would call it (precipitated by Covid). For decades, IT organizations invested in building, managing, and monitoring LANs. Everything was on your local network: your CRM, your Exchange email, the file shares, and the print server. Today, many companies are shutting down their “old legacy network” and are running their enterprise without a LAN, WAN, or an OnPrem datacenter.

Understanding Your Amazon EKS Spend

Most customers running Kubernetes clusters Amazon EKS are regularly looking for ways to better understand and control their costs. While EKS simplifies Kubernetes operations tasks, customers also want to understand the cost drivers for containerized applications running on EKS and best practices for controlling costs. Anodot has collaborated with Amazon Web Services (AWS) to address these needs and share best practices on optimizing Amazon EKS costs.

What is MLOps?

MLOps is the short term for machine learning operations and it represents a set of practices that aim to simplify workflow processes and automate machine learning and deep learning deployments. It accomplishes the deployment and maintenance of models reliably and efficiently for production, at a large scale. MLOps is slowly evolving into an independent approach to the machine learning lifecycle that includes all steps – from data gathering to governance and monitoring.

Track maintainers and purpose for hosts in your infrastructure

When something goes wrong or looks fishy for a particular host in your infrastructure how do you know who to ask about it? In an infrastructure managed by many and used by many it is also helpful to know what each hosts’ purpose is. In this article we show how to add maintainer and purpose information to individual hosts in your infrastructure via the CMDB feature of Mission Portal. We will also add a Build Module to add this information to the /etc/motd file for each associated host.

When to Use K3s and RKE2

K3s and Rancher Kubernetes Engine (RKE2) are two Kubernetes distributions from the SUSE Rancher container platform. Either project can be used to run a production-ready cluster; however, they target different use cases and consequently possess unique characteristics. This article will explain the similarities and differences between the projects. You’ll learn when it makes sense to use RKE2 instead of K3s and vice versa.

The Immutability of Time Series Data

Time series data often comes in large volumes that need to be handled carefully to produce insights in near real time. We’re constantly moving through time. The time it took you to read this sentence is now forever in the past, unchangeable. This leads to something unique about data with a time dimension: It can only go in one direction. Time series data is different from other data for many reasons.

Monitoring and Observability in Azure Services

In the software industry, monitoring and observability come hand in hand and are referenced simultaneously, but both are not the same thing. Having the right tools for these capabilities is vital to keep your systems, data, and security perimeter safe. With cloud solutions topping up in the integration space, great tools have been available in the market, and choosing the right one would be challenging.

Explaining Load Average in Linux for System Administrators & Developers

When you run the uptime command, most of you might be familiar with the three numbers appearing on the top right corner of your Linux screen. But, do you know what those numbers indicate or why there are three such numbers? It is called the load average, a metric that assesses the load on your computer systems. While it can be considered a precise tool for measuring system and resource engagement, it would only be worthwhile if you understand it right.

Observability Pipelines for an SRE

In data management, numerous roles rely on and regularly use observability data. The Site Reliability Engineer is one of these roles. Site Reliability Engineers (SREs) work on the digital frontlines, ensuring performant experiences by using observability data to maintain stability and awareness of software running in various environments across organizations.