Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Detecting and responding to Cloud Logging events in real-time

Logging is a critical component of your cloud infrastructure and provides valuable insight into the performance of your systems and applications. On Google Cloud, Cloud Logging is a service that allows you to store, search, monitor, and alert on log data and events from your Google Cloud Platform (GCP) infrastructure services and your applications. You can view and analyze log data in real time via Logs Viewer, command line or Cloud SDK.

Troubleshooting Encoding Errors in Ruby

Text encoding is fundamental to programming. Web sites, user data, and even the code we write are all text. When encoding breaks, it can feel like the floor is falling out from under you. You're cast into a dimension of bitmasks and codepoints. Logs and backtraces are useless. You consider trading your text editor for a hex editor. But there's hope! In this article, Jose Manuél will show us how encoding errors happen, how they're expressed in Ruby, and how to troubleshoot them.

Process Automation for Modern IT Operations Management

Automating repetitive tasks and processes has always been a top priority for business operations and enterprise IT teams. As we experience accelerated digital transformation in this time of remote work, process automation has become essential. In this Tech Talk for IT operations management, we'll focus on the key process automation trends and challenges and demonstrate OpsRamp's remediation and automation capabilities.

The Evolution of Open Source Observability

On May 27, the first OpenObservability Conference was held to bring together leaders, practitioners, and users of leading open source observability tools for sessions on the experiences, strategies, and future of the industry. For the Logz.io team, as long-time proponents of open source, it was rewarding to see everyone come together to explore the challenges and opportunities of open source observability.

Catchpoint's SRE Report 2020 - The Highlights

Our 2020 SRE Report is ready! We launched the SRE survey 2020 this January with the goal of understanding the current state of SRE. The survey covered a range of topics including: As we neared the end of the survey period, the SRE community was in the midst of a sudden change. SRE teams were forced to migrate to all-remote IT. We realized we would not be able to provide an accurate analysis without considering this shift in how SRE teams were operating in this new environment.

Using Command Endpoint to execute Checks on Agents

Getting started using Icinga in a single node setup can already be quite a challenge for new users. Configuring checks on multiple nodes can seem at lot harder. With this blog post I will show you how easy it is to configure hosts and services in such an environment. In this example our cluster will consist of one master and two agents and we will use the command endpoint method to execute checks on our agents.