Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring Your Platform From Multiple Locations

Mature start-ups and scale-ups create wonderful and challenging environments for Engineers. As the product they’re creating matures and the brand becomes a successful one, the user base generally starts growing, and, for some companies, in places they might not expect it to grow. As that happens, new challenges arise for Engineers. One of these challenges is pretty straightforward to guess. Basically having a particular product available throughout different regions of the world.

Kubernetes 101: How To Set Up "Vanilla" Kubernetes

Kubernetes is an open source platform that, through a central API server, allows controllers to watch and adjust what’s going on. The server interacts with all the nodes to do basic tasks like start containers and pass along specific configuration items such as the URI to the persistent storage that the container requires. But Kubernetes can quickly get complicated. So, let’s look at Vanilla Kubernetes — the nickname for a a K8s setup that’s as basic and elementary as it gets.

Using Automation to Transform IT From Cost Center to Value Driver

In today’s digital age, IT has become the central component of business operations. Yet, despite its critical importance, skilled technicians continue to find their hands tied by time-consuming manual tasks. And, while many of these tasks are essential, they do virtually nothing to drive innovation. Introducing automation into the mix can free up IT talent to focus their skills on more important business initiatives – particularly those that drive change and generate revenue.

An Introduction to Kubernetes Observability

If your organization is embracing cloud-native practices, then breaking systems into smaller components or services and moving those services to containers is an essential step in that journey. Containers allow you to take advantage of cloud-hosted distributed infrastructure, move and replicate services as required to ensure your application can meet demand, and take instances offline when they’re no longer needed to save costs.

How Gremlin's reliability score works

In order to make reliability improvements tangible, there needs to be a way to quantify and track the reliability of systems and services in a meaningful way. This "reliability score" should indicate at a glance how likely a service is to withstand real-world causes of failure without having to wait for an incident to happen first. Gremlin's upcoming feature allows you to do just that.

Monitor your T2A-powered GKE workloads with Datadog

Arm processors have become increasingly popular in recent years, providing energy-efficient, cost-effective processing power to both mobile and cloud computing ecosystems. As a part of this growth, more and more organizations are choosing to leverage the many benefits of Arm-based architectures for their containerized workloads. Today, Google Cloud announced its Arm-based Tau T2A virtual machines (VMs), which you can also use to run workloads in Google Kubernetes Engine (GKE).

The Role of Middleware in Distributed Systems

In distributed systems, middleware is a software component that provides services between two or more applications and can be used by them. Middleware can be thought of as an application that sits between two separate applications and provides service to both. In this article, we will see a role of middleware in distributed systems.

The Leading Tools Compatible With OpenTelemetry

OpenTelemetry (also known as OTel) is a popular open-source framework used to generate telemetry data for traces, metrics, events and logs. In this guide, we are going to cover the best observability and application performance management tools that can be used alongside OpenTelemetry to transform telemetry data into responsive reporting dashboards.

Lars Rossen on What to Expect From IT4IT 3.0

IT4IT was created as a framework for IT service management, and has established itself as an alternative — or perhaps complementary? — standard to the widely acclaimed ITIL. But since it's been around for a decade now, it's about to change. Lars Rossen — one of the creators of the first version of the IT4IT Reference Architecture, which formed the basis for the standard — told us first-hand what to expect from IT4IT 3.0 on Episode 9 of Ticket Volume podcast.

What is a Neural Network (and How Does it Train Itself)?

You’ve probably heard about neural networks being hailed as the next big step in technological advancements in artificial intelligence (AI). Beyond its often exaggerated depiction in fiction and media, neural networks have slowly but steadily become an invaluable asset in the IT world. It is under constant research in data science and computer science.