Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Exception Perceptions: Attack of the Cloned Issues (for Better Understanding User Behavior)

We’re back! That’s right, we have a new episode of Exception Perceptions to share with y’all. This is part 2 of of our Star Wars series, and this time we’re talking all about using errors to better understand user behavior.

What is MTTR? How to measure and improve your Mean Time to Recovery

Complex distributed systems run just about every service imaginable. Healthcare systems that monitor patient health, security systems, and financial systems are all mission-critical. Downtime, or lack of availability, loses money and can even put lives at risk. These systems must be monitored. Many measurements are useful to keep systems running with as little downtime as possible. One of those is Mean Time To Recovery. (MTTR.)

PagerDuty Launches Event Intelligence to Extend Its Market-Leading Digital Operations Management Platform

SAN FRANCISCO – June 7, 2018 – PagerDuty today announced PagerDuty Event Intelligence, a new product that builds on its market-leading digital operations management platform. Event Intelligence analyzes both incoming digital signals and human response patterns (such as when and how alerts are resolved by responders) and uses automation to identify issues quickly, enabling teams to take action on critical software issues and opportunities.

Instrument Your Ruby App In Minutes with the Honeycomb Beeline for Ruby

Meet the Honeycomb Beeline for Ruby. Like our Beelines for Go and Node, it understands the common packages you’re using and automatically instruments them to send useful events to Honeycomb. Then once you’ve got a chance to explore your app’s behavior, you can add custom fields specific to your app with just one line of code.

Adopting a SIEM solution, Part 1: Why choose SIEM?

Many enterprises are at the brink of digital transformation, which entails adopting new technologies that process a sea of both personal and enterprise-level data. Despite a surging number of innovations to prevent evolving cyber threats from hijacking that data, the sheer number of successful, high-profile data breaches and attacks recently highlight the insufficient security practices of organizations around the globe.

The Importance of Application Decomposition in App Performance Testing

DevOps is changing the way companies develop and maintain software. By embedding operations engineers into software development teams, companies are reducing the average number of days from code completion to live production and eliminating wait time and rework, among other benefits. But as I pointed out in my previous post “Performance Testing in a DevOps World” performance testing remains the weakest link in the process.