Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

BigPanda's Event Enrichment Engine: The secret ingredient for AIOps

James Beard, the pioneer of television cooking shows, once asked, “Where would we be without salt?”. Salt is often underrated, even though it is the ingredient that has the greatest impact on food and flavor in the modern world. It has its own taste, but also balances and enhances the flavor of other ingredients. Salt boosts sweetness and blocks bitterness, it has scientifically proven capabilities to intensify flavor compounds that are too subtle to detect (i.e.

Monthly Moo Update | July 2021

We hope June was as good to you as it was to us. Our latest updates, available now, will keep you relaxing poolside this summer knowing that your monitoring, event correlation, and incident workflows are all connected and automated through the cloud. If you’re not relaxing with a little cloud coverage keeping you cool, then come check out Moogsoft to see how you can keep your services available and your customers happy, so you can get to relax with a little more time in your day.

What is a Blameless Postmortem?

Do blameless retrospectives (or postmortems) help your team? We will explain what they are, if they really work, and how to do them right. A blameless postmortem (or retrospective) is a post-incident document that helps teams figure out why an incident happened, and brainstorm how to improve the process to prevent similar incidents from happening again. In most engineering organizations, everyone agrees that in complex systems, failure is inevitable.

Using CC&C Platforms to Transform Metrics Into Valuable Insights

Healthcare institutions are increasingly implementing clinical communication and collaboration (CC&C) platforms to improve the productivity of care teams. Automated CC&C platforms perfect care orchestration plans to ensure providers have the means to satisfy the ever-changing needs of patients. Key features of CC&C platforms include real-time, secure mobile messaging and alerting; digital, intelligent on-call schedules; time-stamped message statuses; and automated alert escalations.

Error Budgets That Work for You. Plus Support for New Relic Metrics and NR Query Language

Error Budgets That Work for You. Plus Support for New Relic Metrics and NR Query Language Did you know that error budget policy is the key to making SLOs actionable? In fact, Twitter’s engineering team did not successfully adopt SLOs until they introduced error budgets. SLOs enable teams to quantify customer happiness, and error budgets enable teams to make data-backed tradeoffs between reliability and feature velocity. We believe that teams optimizing for reliability must adopt both.

Rootly Announces $3.2 Million in Seed Funding from XYZ Venture Capital, 8VC, & Y Combinator

Rootly is on a mission to create a world where maintaining reliability is frictionless, delightful, and accessible to anyone. Making resolving and learning from incidents every organizations superpower.

Cherwell Monitoring in Production

I have been working on a couple of monitoring ideas for Cherwell. I didn’t see anything with a quick online search, and I enjoy authoring MPs to monitor applications, it is the closest I’ll get to 007. I’ve hit a major hurdle and I need to ask for a hand from the community. We have a lab environment that’s worked great while developing the Cherwell integration for Connection Center, however, it is not a good simulation for an actual deployment.

July 2021 Update: Users can be members of multiple teams

The time has come! Users in SIGNL4 can now be a member of multiple teams. This allows for staff to be on duty in multiple groups or departments in parallel and to receive related alert notifications for incidents that occur in the different teams. In addition, you can now also send Signls to multiple teams. All details are available in this article.