Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

New SCA-ready Payments System

The Downtime Monkey payments system looks super simple but behind the scenes it's not trivial... This took several months to develop in 2017 and we're proud to say it worked like clockwork. So when we learned that we'd have to redevelop the system to comply with new SCA regulations you can understand that we weren't overcome with enthusiasm...

How Grafana Labs Is Running Jaeger at Scale with Prometheus and Envoy

At Grafana Labs, we are always working on optimizing query time on our Grafana Cloud hosted metrics platform, which incorporates our Metrictank Graphite-compatible metrics service, and Cortex, the open source project for multitenant, horizontally scalable Prometheus-as-a-Service.

Speed and Quality are Not Mutually Exclusive: Telemetry is the Key

All engineering teams strive to build the best product they can as quickly as possible. Some, though, stumble into a false dichotomy of choosing between speed and quality. While that choice may have been necessary in the past, it’s not the case today. What I’d like to do in this article is explain why.

Combating threats with UEBA: Health is wealth

In this four-part series, Combating threats with UEBA, we explore hypothetical cyberattacks inspired by real-life events in four different industries: healthcare, finance, manufacturing, and education. We’ll take a look at unforeseen security attack scenarios, and discover how user and entity behavior analytics (UEBA) can be leveraged to safeguard organizations.

Step by Step guide on how to schedule the snapshot of mulitple VMware or Hyper-V Virtual Machines with auto deletion.

Every so often you need to schedule a snapshot on a virtual machine, whether that be a VMware virtual machine or a Hyper-V virtual machine, or even multiple virtual machines because someone or something is going to be changed and you need a quick way to roll them back to a running state, just in case things go belly up. It’s a very common task that administrators need to do.

Getting Started Using Cycle.io's API With NodeJS

The recent public release of Cycle’s API has already seen all sorts of innovative uses, from automating the deployments of medical applications, to creating customized monitoring services to track specific performance metrics. Everything you can do in the portal can also be accomplished via the API — it’s actually the exact same API we used to build the portal!

Introducing Self-Service Transaction Monitoring-Beta

We’ve released our newest feature, Self-Service Transactions (SST), for the beta release. Although you can still have us take care of the scripting for you (Full-Service Transactions), with SST you write and manage your own scripts. You can use our handy visual Step Editor, write your code directly into the text editor, or interface with our API.

How to collect and manage all of your multi-line logs

Multi-line logs such as stack traces give you lots of very valuable information for debugging and troubleshooting application problems. But, as anyone who has tried knows, it can be a challenge to collect stack traces and other multi-line logs so that you can easily parse, search, and use them to identify problems. This is because, without proper configuration, log management services and tools do not treat multi-line logs as a single event.

Monitoring Hashicorp Vault with LogicMonitor

Hashicorp Vault is an open-source secret management tool that allows organizations to easily "secure, store and tightly control access to tokens, passwords, certificates, encryption keys for protecting secrets and other sensitive data using a UI, CLI, or HTTP API." This solution prevents sensitive information from being stored in unsecured places, and at times stored in plaintext, throughout the organization’s infrastructure.