Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Real Talk webinar recap: analytics and reporting maturity

MTTR, or mean time to resolve, is an important key performance indicator for incident response teams to track, but it’s rarely useful for technological stakeholders or customers. To really make use of the data at their disposal, decision-makers must tailor the info they provide—and understand the scope and granularity of the data they have when they deploy an AIOps platform like ours. That’s the gist of our latest Real Talk webinar on analytics and reporting maturity.

5 Reliability Insights That Immediately Transform Your SRE

As infrastructure engineers, there’s so much you can learn from studying past incidents. Luckily, Blameless Reliability Insights helps you find patterns that better equip you to deal with incidents to come. If you’ve never used it before and you’re curious what it looks like, you can watch a video demo here! These statistical insights give you the power to learn everything you can when something goes wrong. ‍

Zenduty's Commitment to Security; Soc 2 Type 2

Security is a major requirement while dealing with SAAS companies across the globe. As a service provider to leading companies globally YellowAnt is fully committed to provide the best in class security compliance, in lieu of that we on May 31, 2022 have become Soc2 Type II compliant. It is integral to maintain our customers’ trust by keeping their data safe and secure.

How To Build an Escalation Policy for Effective Incident Management

Regardless of your organization’s size, industry, or security measures, you will inevitably face IT incidents. But what do you do if an incident affects a critical system and your on-call responders can’t resolve it? Does your team have a set of clearly outlined next steps they should take to handle the issue? Answering these questions can be complicated, even more so for large organizations that rely on cloud-based services to fuel their IT environment.

StatusCast expands product offering with Incident Management for IT Platform

May 31, 2022 – Columbia, MD – StatusCast today announced the release of its IT Incident Management service, expanding its flagship offering from best-of-breed Status Page services to include the full incident management life-cycle. The new offering goes beyond standard status updates, allowing IT teams to respond faster and with more effectiveness when systems fail or go offline.

What's New: Updates to Incident Response, AIOps, Pagerduty Process Automation, and More!

Summit’s right around the corner (have you registered yet?) but the shipping doesn’t stop! We’re excited to announce a new set of updates and enhancements to PagerDuty’s Digital Operations Platform. Recent updates from the product team include On-Call Management, Incident Response, Process Automation, and Integrations, to PagerDuty Community & Advocacy Events. New capabilities enable users and customers to resolve incidents faster, do the following, and more.

Getting AWS CloudTrail alerts via SNS Endpoint

Logging and auditing have been an essential part of troubleshooting application and infrastructure performance. You can instantly spot areas of risk to ensure quick correction and prevention of issues. In this blog, we will explore the AWS CloudTrail service and discuss how integrating it with Squadcast can help you route alerts to the right users for quick and efficient incident response. Let's get started.
Sponsored Post

Simplifying SLO and Error Budget tracking for SRE teams

Service level objectives (SLOs), and the subsequent service level indicators (SLIs) are the foundation to establishing a strong SRE culture and how they promote accountability, trust and timely innovation. We are on a mission to simplify SLO and Error Budget tracking and with that aim in mind, we have added the SLO Tracker feature to the Squadcast platform. SLO Tracker seeks to provide a simple and effective way to keep track of your error budget burn rate without the hassle of configuring and aggregating multiple data sources.

5 Tips If You're the 1st SRE Hire by Instacart's First SRE

Site Reliability Engineers (SREs) have a considerable set of tasks to juggle no matter where they work or how long their company has had an SRE practice. But if you’re the very first SRE to join an organization – as many SREs are these days, given that the SRE trend is trickling down into smaller and smaller companies – you face a special group of challenges. You may find it difficult to get buy-in for SRE from other technical teams.