Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Challenges Faced by MSPs in Light of COVID-19

The COVID-19 crisis has proven to be a challenging time for IT support teams and managed service providers (MSPs). It hasn’t only left these organizations in a vulnerable position, but also in a state of uncertainty as to what may be in store for them. OnPage interacts with current and prospective clients ranging from large businesses to small and medium enterprises (SMEs).

Getting SRE Buy-in from a Manager or Lead for Incident Response, Part 1

Adopting SRE best practices can be difficult, especially when you need approval from managers, VPs, CTOs, and everything in between. In this blog series, we will walk you through how to come up with a winning pitch for each level of leadership to ensure that SRE buy-in will succeed in your organization. Let’s start at the beginning with your team lead or manager.

Virtualize the NOC: Accelerate Your Transition to Remote IT Ops with AIOps

The sudden shift to remote work caused by the global pandemic has forced IT Ops pros to quickly adjust in multiple ways to maintain the uptime and stability of critical digital services. Amidst this crisis, AIOps has emerged as a lifeline, as it facilitates remote collaboration, streamlines incident management, and accelerates detection and resolution.

Resilience in Action, Episode 1: Narratives in Incidents with Lorin Hochstein

Resilience in Action is a podcast about all things resilience, from SRE to software engineering, to how it affects our personal lives, and more. Resilience in Action is hosted by Blameless Staff SRE Amy Tobey. Amy has been an SRE and DevOps practitioner since before those names existed. She cares deeply about her community of SREs and wants to take what she’s learned over the 20+ years of her career to help others. In our very first episode, Amy chats with Netflix software engineer Lorin Hochstein.

Collaborate through chaos with Opsgenie's new Slack app for incidents

Get stories like this in your inbox During an IT incident, every second counts – but the first few minutes are the most critical. Teams who can rapidly spin up the right tools and processes have the best shot at fast resolution. And of course, many teams rely on chat tools to collaborate and communicate during incidents. So we’re excited to announce our new Slack app for Opsgenie Incidents.

Technology Innovation Snapshot: How Blameless Accelerates Team Performance

In Digital Enterprise Journal’s March Edition of its Technology Innovation Snapshot, Blameless was listed among 11 other companies as promising vendors. Blameless is honored to be alongside companies such as Gremlin, Catchpoint, and Moogsoft, and excited about the future DEJ sees for the SRE space.

How PagerDuty's Ecosystem Partners Are Helping People During the COVID-19 Crisis

For many of us, “working” is incredibly difficult right now. That’s true at the organizational level, where maintaining business continuity and accounting for changes in customer needs are even more critical. But it’s also true at the individual level, where the sudden shift to working from home has jolted us all into working in new ways, and made virtual collaboration an essential part of each workday.

April 2020 Update: Goodbye "I never got that alert" and emergency alerting - the new Signl Center

Our April update is BIG. It introduces emergency alerting to reach you entire team. We hope this will be a bit of humble support to your organization in this Covid-19 crisis. The core of this release is the new signl center, the new place to track alerts and their delivery in real-time, to see incoming events and how they are processed. You can now send emergency alert to your entire team with a single click.