Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Does Observability Throw You for a Loop? Part Two: Close with Controllability

In part one, we introduced the duality of observability, controllability. As a reminder, observability is the ability to infer the internal state of a "machine” from externally exposed signals. Controllability is the ability to control input to direct the internal state to the desired outcome. So observability is a loop problem. And we need to stop treating it as the end state of our challenge in delivering performant, quality experiences to our users and customers.

Adapting to The New Normal in IT Operations

The waves of change are certainly upon us and businesses are being forced to adapt at a record pace. Current world events have caused a jarring shift in all aspects of our lives, accelerating major changes in how we live and work. An unprecedented number of people are now working from home. Those of us working in IT Operations are no exception. Many companies are implementing a Distributed IT Operations Center (D-NOC) approach to address this new reality.

Closer Look: Intelligent Infrastructure Monitoring

In this article: Infrastructure monitoring systems have been undergoing massive change in the past few years. Before the proliferation of cloud computing, IoT and edge computing, mobile apps and SaaS apps, the practice was fairly static, albeit traditionally quite siloed. IT managers needed to get data on a set of standard metrics, from a handful of tools.

Learn snapcraft by example - multi-app client-server snap

Over the past few months, we published a number of articles showing how to snap desktop applications written in different languages – Rust, Java, C/C++, and others. In each one of these zero-to-hero guides, we went through a representative snapcraft.yaml file and highlighted the specific bits and pieces developers need to successfully build a snap. Today, we want to diverge from this journey and focus on the server side of things.

Top 10 Free Tools to Check Your SSL Certificate Installation

You can utilize an SSL checker to troubleshoot the common SSL errors and vulnerabilities if you have installed an SSL certificate before. There is a plethora of tools you can choose from. You can also use a free SSL checker if you are working on a low budget. And using an SSL checker tool is quite simple. You simply must submit the domain name or IP address and the port number it requires to examine your website’s configuration and security.

How to create and monitor redundant means to access network

Remote access to intranets and otherwise restricted areas was always a challenge. On one hand, it’s desirable to re-create convenient and familiar access to the same resources; on the other hand, there are many issues, starting from ensuring security and access reliability The same is related to monitoring. If a network’s devices should be monitored, there always should be access to them; thus, there’s a separate task of monitoring accessibility. Below are several guidelines.

Slow and steady

“Remember that guy that gave up? Neither does anybody else.” Progress on the new Monitive has been slow as a snail in a marathon, but nevertheless the progress was there. Since the last update last year, a lot has been going on, and one day I woke up and just decided to write, even if small updates. Not only because Monitive is about transparency, but because I love writing.

Monitor Apache Flink with Datadog

Apache Flink is an open source framework, written in Java and Scala, for stateful processing of real-time and batch data streams. Flink offers robust libraries and layered APIs for building scalable, event-driven applications for data analytics, data processing, and more. You can run Flink as a standalone cluster or use infrastructure management technologies such as Mesos and Kubernetes.

Incident Response in the time of Remote Work

The unexpected and sudden shift to remote working introduces a new set of problems within the incident response space. And while each organization needs to take its own unique circumstances into account, this post outlines the best practices and steps that can be taken in the right direction in keeping operations both productive and proactive.