Operations | Monitoring | ITSM | DevOps | Cloud

%term

Exception Perceptions: Attack of the Cloned Issues (for Better Understanding User Behavior)

We’re back! That’s right, we have a new episode of Exception Perceptions to share with y’all. This is part 2 of of our Star Wars series, and this time we’re talking all about using errors to better understand user behavior.

What is MTTR? How to measure and improve your Mean Time to Recovery

Complex distributed systems run just about every service imaginable. Healthcare systems that monitor patient health, security systems, and financial systems are all mission-critical. Downtime, or lack of availability, loses money and can even put lives at risk. These systems must be monitored. Many measurements are useful to keep systems running with as little downtime as possible. One of those is Mean Time To Recovery. (MTTR.)