Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Code Refactoring and why you should refactor your code

Software does not expire, but it “rots”. Its quality degrades over time. As you build your project and add features, you probably won’t always build it in a clean, orderly and mindful way. Especially if you have a tight deadline. So aside from features, you also produce bugs, code smells, and technical debt. That “rots” your software, but your job as a software engineer is to maintain its “freshness” while building on top of it.

Monitoring with Graphite: Architecture and Concepts

In this article, we provide a concise guide to help you get started with Graphite quickly and efficiently. We cover the basic concepts, architectural considerations, and metrics aggregation of Graphite. We also explain the data feeding methods, metrics format, and storage using Graphite's file-based database. Additionally, we discuss visualization options, including Graphite-Web and Grafana.

The lowdown on Loki for log aggregation: 5 demos you don't want to miss

Looking to get started with log aggregation? Or perhaps take your logging game to a whole new, more advanced level? You’ve come to the right place. Grafana Loki is a key component of Grafana Labs’ open and composable Grafana LGTM stack (Loki for logs, Grafana for visualization, Tempo for traces, Mimir for metrics).

Grafana k6 v0.45.0 release: gRPC streaming support, cloud script updates without running tests and more!

Grafana k6 v0.45.0 has been released, featuring a new experimental module for gRPC streaming support, a new browser recorder extension for Firefox and Chrome, and tons of improvements for Grafana k6 OSS and Grafana Cloud k6. Here’s a quick overview of the latest k6 release and all the news from the community.

Azure Incident Management with Escalation Policy

These days, businesses heavily rely on cloud services like Microsoft Azure to power their operations. While Azure provides robust infrastructure and services, occasional issues and incidents can still occur. Serverless360 provides enhanced capabilities to monitor and manage Azure incidents in a system. But to ensure seamless operations and timely resolution of problems, it is crucial to have a well-defined escalation policy in place for Azure Incident Management..

The Curious Case Of Kubernetes Health Checks

Health checks for cloud infrastructure refer to the mechanisms and processes used to monitor the health and availability of the components within a cloud-based system. These checks are essential for ensuring that the infrastructure is functioning correctly and that any issues or failures are detected and addressed promptly. Health checks typically involve monitoring various parameters such as system resources, network connectivity, and application-specific metrics.

How to Trial Honeycomb and OpenTelemetry

Insightful proof-of-concepts with a tool can be difficult to undertake due to the demands on valuable resources: time, energy, and people. With a task as grand as observability, how could one truly test if Honeycomb and OpenTelemetry are right for their organization and meet their requirements? For this thought experiment, here’s a comprehensive description of the ideal product evaluation over the course of four weeks, given unlimited resources.

Should I Stay or Should I Go? Get smarter about your refresh cycles

Deciding what to migrate, what to modernize, and what to retain on-premises is part of enterprise IT infrastructure management. When a refresh cycle is up in your data center, there are two very different types of competing motions you need to evaluate. While they may appear to be independent, they’re also kind of not, so it can be tricky to decide which one to execute—or even to execute both—and to do so smoothly.

Expand Your Monitoring Capabilities with AppSignal's Standalone Agent Docker Image

Want to monitor all of your application's services? Our Standalone Agent allows you to monitor processes our standard integrations don't monitor by default, helping you effortlessly expand your monitoring capabilities. To help simplify the process of configuring our standalone agent, we're excited to announce the launch of our Standalone Agent's Docker image, available on Docker Hub under the name appsignal/agent.

InfluxDB 3.0: System Architecture

InfluxDB 3.0 (previously known as InfluxDB IOx) is a (cloud) scalable database that offers high performance for both data loading and querying, and focuses on time series use cases. This article describes the system architecture of the database. Figure 1 shows the architecture of InfluxDB 3.0 that includes four major components and two main storages.