Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

Synced for Success: OnPage & Slack for Incident Response

As the post-pandemic world finds its footing again, a resilient spirit drives the revival, propelling businesses to embrace a new era of technological innovation. Notably, IT teams are swiftly adopting the digital transformation of their processes, particularly in incident response. From virtual collaboration tools and remote IT support to automated incident management, teams have found innovative ways to ensure seamless business continuity while delivering IT services with minimum downtimes.

Evolution of Site Reliability - Incidentally Reliable with Manoj Sebastian

Catch Manoj Sebastian(ex-Flipkart, Amazon, Atlassian, Intuit, Yahoo) talk about The Evolution of SRE through 20 years, Incident Response and Post Incident Culture at Big Tech and the Future of Reliability with AI ramping up at full speed. The freshest podcast for Site Reliability Engineers, hosted by Vishwa and Shubham from Zenduty.

Automatic log level detection reduces your cognitive load to identify anomalies at 3 am

Let’s face it, when that alert goes off at 2:58am, abruptly shaking you out of a deep slumber because of a high-priority issue hitting the application, you’re not 100% “on”. You need to shake the fog out of your head to focus on the urgent task of fixing the problem. This is where having the best log analytics tool can take on some of that cognitive load. Sumo Logic recently released new features specific to our Log Search queries that automatically detect log levels.

What Is Adaptive Thresholding?

Adaptive thresholding is a term used in computer science and — more specifically — across IT Service Intelligence (ITSI), for analyzing historical data to determine key performance indicators (KPIs) in your IT environment. Among other things, it’s used to govern KPI outliers in an effort to foster more meaningful and trusted performance monitoring alerts.

AIOps & Observability Market Trends and Insights

Join Ron Williams, Principal Analyst at GigaOm, and Shailesh Manjrekar, Chief Marketing Officer at CloudFabrix, in examining the Market Trends of AIOps & Observability. We will also dive deep into the recent GigaOm Radar and discuss "Why is CloudFabrix one of the only two Outperformers for Gigaom Radar?"#gigaom #gigaomradar #webinar #dataops #thoughtleadership.

In review: Gartner Hype Cycle for Monitoring and Observability

You know it’s going to be a great day when you find yourself mentioned as a sample vendor on the well-read Gartner’s Hype Cycle report. The OnPage team is thrilled to share with its community that we have been mentioned as a sample vendor by Gartner on their latest Hype Cycle for Monitoring and Observability. Continuing its impressive streak of mentions this year, OnPage is featured as a sample vendor, specifically within the Automated Incident Response category.

Datadog and BigPanda: Observability and AIOps made better together

Datadog’s modern observability empowers development engineers with full-stack visibility, comprehensive instrumentation generation, and proactive alerts to accelerate software development releases and address potential incidents. While Datadog gives teams end-to-end visibility, it works even better together with AIOps from BigPanda – development teams gain insights into outside application dependencies and reliance on other systems.

How summertime turns up the heat on cyber readiness (and what to do about it)

“Malicious cyber actors aren’t making the same holiday plans as you.” (CISA & FBI) Summertime is prime time for cyberattacks. According to one survey, 58% of security professionals believe that there is seasonality in the attacks that their company experiences every year, with the majority citing summer as high season for breaches.

Optimizing Resource Scheduling and Planning in Healthcare

The pandemic has exacerbated the staff shortage in healthcare, placing a disproportionate burden on the industry, and underscoring the significance of effective resource scheduling. While resource scheduling encompasses the allocation of healthcare staff and physical resources and assets, in this blog, our primary focus will be on healthcare staff. Resource scheduling plays a vital role in ensuring the smooth operation of healthcare facilities.