Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

3 Things We Learned from EMA About AIOps and the Automation Handshake

AIOps is the trendy cool new kid on the block in the IT operations world. No doubt about it. However, with all the buzz surrounding AIOps, it’s easy to skip over some of the basics. How many IT operations professionals can clearly define what AIOps is? Beyond the baseline definition, why should you care? What about plugging it into your existing automation and analytics ecosystem?

How do we Apply SRE Outside of Engineering with Google's Dave Rensin

The first keynote speaker, he is a senior director of engineering at Google. You might know him as they guy who founded and leads the customer reliability engineering function at Google. CRE, this is a team that teaches the world SRE principles and practices. Now I want to tell you a bit more about him, because I think he has a very unique view and perspective. He is deeply compassionate and intuitive as a teacher, not just a lecturer.

Latest Advancements Made to OnPage

OnPage’s incident alert management platform continues to evolve, providing unique and powerful capabilities to business clients. Latest advancements include live call routing reporting and a sophisticated dashboard for enterprise users. The capabilities enhance team transparency and performance, improving incident management and collaboration in the process. In this blog post, I’ll discuss the benefits of the features and how they improve workflows.

New in Grafana 6.6: Forcing minimum alert evaluation frequency

There has long been a request from administrators to have the ability to enforce a minimum interval between alert rule evaluations. This is useful for restricting unrealistic user-defined alert rules that evaluate too often and create unnecessary load in the backend. @Uepoch took the initiative and made all the necessary modifications for this configuration in Grafana’s backend, and we finally pushed it forward and introduced the feature in Grafana v6.6.

BYOD, Secure Messaging and HIPAA Compliance

Nine in 10 hospitals have already made or are making significant investments in smartphones. However, with the emergence of this trend, the threat of non-secure messaging has been on the rise. Our latest eBook discusses the potential risks of this rising trend and possible solutions to remain HIPAA-compliant, all while streamlining clinical communications and operations. Learn how your organization can practice safe and secure messaging in order to enhance the healthcare experience today.

3 Reasons Why Machine Learning Anomaly Detection is Critical for eCommerce

Do you still find yourself visually monitoring dashboards for anomalies? That leaves catching revenue-related issues to chance. It’s become humanly impossible to catch incidents on streaming data. This is why many eCommerce and data-driven companies have adopted automated anomaly detection.

Flip your thinking to find the right incident management KPIs

Setting and tracking key performance indicators based on the right data can help incident management teams reduce the impact of incidents and strengthen the business. But what exactly is the right data? That can be a deceptively tricky question. Incidents are complex, and no two are exactly the same – and your KPIs must reflect this complexity.