Operations | Monitoring | ITSM | DevOps | Cloud

Using Automation to Transform IT From Cost Center to Value Driver

In today’s digital age, IT has become the central component of business operations. Yet, despite its critical importance, skilled technicians continue to find their hands tied by time-consuming manual tasks. And, while many of these tasks are essential, they do virtually nothing to drive innovation. Introducing automation into the mix can free up IT talent to focus their skills on more important business initiatives – particularly those that drive change and generate revenue.

An Introduction to Kubernetes Observability

If your organization is embracing cloud-native practices, then breaking systems into smaller components or services and moving those services to containers is an essential step in that journey. Containers allow you to take advantage of cloud-hosted distributed infrastructure, move and replicate services as required to ensure your application can meet demand, and take instances offline when they’re no longer needed to save costs.

How Gremlin's reliability score works

In order to make reliability improvements tangible, there needs to be a way to quantify and track the reliability of systems and services in a meaningful way. This "reliability score" should indicate at a glance how likely a service is to withstand real-world causes of failure without having to wait for an incident to happen first. Gremlin's upcoming feature allows you to do just that.

Monitor your T2A-powered GKE workloads with Datadog

Arm processors have become increasingly popular in recent years, providing energy-efficient, cost-effective processing power to both mobile and cloud computing ecosystems. As a part of this growth, more and more organizations are choosing to leverage the many benefits of Arm-based architectures for their containerized workloads. Today, Google Cloud announced its Arm-based Tau T2A virtual machines (VMs), which you can also use to run workloads in Google Kubernetes Engine (GKE).

The Role of Middleware in Distributed Systems

In distributed systems, middleware is a software component that provides services between two or more applications and can be used by them. Middleware can be thought of as an application that sits between two separate applications and provides service to both. In this article, we will see a role of middleware in distributed systems.

We've raised $34M to help organisations be resilient in the face of failure

TL;DR: We’ve raised $34M to bring increased resilience to organisations around the world. With this latest round of investment we’re expanding internationally in the US, accelerating our product plans, and growing our amazing team 🎉 As technology becomes more complicated and runs an ever greater part of our lives, failure becomes more inevitable, and more costly.

The Leading Tools Compatible With OpenTelemetry

OpenTelemetry (also known as OTel) is a popular open-source framework used to generate telemetry data for traces, metrics, events and logs. In this guide, we are going to cover the best observability and application performance management tools that can be used alongside OpenTelemetry to transform telemetry data into responsive reporting dashboards.

Lars Rossen on What to Expect From IT4IT 3.0

IT4IT was created as a framework for IT service management, and has established itself as an alternative — or perhaps complementary? — standard to the widely acclaimed ITIL. But since it's been around for a decade now, it's about to change. Lars Rossen — one of the creators of the first version of the IT4IT Reference Architecture, which formed the basis for the standard — told us first-hand what to expect from IT4IT 3.0 on Episode 9 of Ticket Volume podcast.

What is a Neural Network (and How Does it Train Itself)?

You’ve probably heard about neural networks being hailed as the next big step in technological advancements in artificial intelligence (AI). Beyond its often exaggerated depiction in fiction and media, neural networks have slowly but steadily become an invaluable asset in the IT world. It is under constant research in data science and computer science.

What is QoS

Quality of Service (QoS) uses methods or technologies on networks to control traffic and ensure the performance of critical applications with limited network capacity. It enables organizations to adjust their overall network traffic by prioritizing specific high-performance applications. Your internet connection is like a highway where different types of vehicles travel to reach their destination. Your car drivers, truckers, average commuters, and emergency services vehicles all share the same lanes.