Latest News

Using incidents to level up your teams

Aug 31, 2022 By Lisa Karlin Curtis In Incident.io

I joined GoCardless as a junior engineer. It was one of my first coding jobs, and in my time there I progressed to senior much faster than I had expected. When I reflect on how this happened, one pattern stands out to me; the big step changes in my understanding, and my ability to solve larger and more complex engineering problems, came as a result of incidents.

Read Post

Incident.io

Read more about Using incidents to level up your teams

What's New: Updates to PagerDuty Process Automation Software & PagerDuty Runbook Automation, Integrations, and More!

Aug 31, 2022 By Vera Chan In PagerDuty

We’re excited to announce a new set of updates and enhancements to the PagerDuty Operations Cloud. Recent development and app updates from the product team include PagerDuty® Process Automation, our Partner Integrations and App Ecosystem, as well as Community & Advocacy Events updates. We continue to help customers automate everywhere to optimize cloud operations and reduce the amount of issues escalated to other teams.

Read Post

PagerDuty

Read more about What's New: Updates to PagerDuty Process Automation Software & PagerDuty Runbook Automation, Integrations, and More!

Software Metrics Every SRE Team Should Measure

Aug 31, 2022 By Myra Nizami In Blameless

Software metrics give important insight into the performance of your product, but which ones matter most to SRE teams? How do you decide which metrics to track?

Read Post

Blameless

Read more about Software Metrics Every SRE Team Should Measure

RESOLVE '22: Bit by bit

Aug 30, 2022 By Ronnel Vergara In BigPanda

It is difficult to define a single, solid maturity model for IT Operations. As moderator Jason Walker, BigPanda’s COO, said in our RESOLVE ’22 event Bit by bit, maturity models in “almost every other domain of IT” have not turned into a workable set of guideposts and indicators in the Ops domain. We welcomed Insurity’s Lead Cloud Operations Performance & Monitoring Admin, Ronnel Vergara, to take the stage and talk over this high-level topic at our event.

Read Post

BigPanda

Read more about RESOLVE '22: Bit by bit

Round Robin Escalation: An Efficient Way to Distribute On-Call Responsibilities

Aug 30, 2022 By Vishal Padghan In Squadcast

Nowadays, organizations address a high volume of incidents everyday. With so much happening, responders can be overwhelmed by the volume of incidents and may end up de-prioritizing certain important incidents. Hence, it is important to have an efficient on-call scheduling and escalation process in place. In this blog, we will explore how Round Robin Escalations can help distribute on-call load and set up efficient on-call schedules. This blog covers the following pointers.

Read Post

Squadcast

Read more about Round Robin Escalation: An Efficient Way to Distribute On-Call Responsibilities

Bridging the gap between Engineering and Customer Support during incidents

Aug 30, 2022 By incident.io In Incident.io

Customer trust and satisfaction are the most important currency your business can own. No matter how brilliant your product, without happy customers your business will struggle. When everything is running smoothly, it’s easy to feel that heady dose of customer love. It’s when things break during an incident that these relationships are really put to the test.

Read Post

Incident.io

Read more about Bridging the gap between Engineering and Customer Support during incidents

The Five Main Components of a Fully Developed EHR System

Aug 30, 2022 By OnPage Corporation In OnPage

The adoption of electronic health record (EHR) systems has seen tremendous growth across geographies, especially in the US. According to American Hospital Association data shared by the Office of the National Coordinator for Health Information Technology, over 93% of American hospitals are enabled by some form of EHR in their organization. Implementing an EHR system in your clinic or hospital is a big decision.

Read Post

OnPage

Read more about The Five Main Components of a Fully Developed EHR System

Get started with Grafana OnCall and Terraform

Aug 29, 2022 By Innokentil Konstantinov In Grafana

Managing on-call schedules and escalation chains, especially across many teams, can get cumbersome and error prone. This can be especially difficult without as-code workflows. Here on the Grafana OnCall team, we’re focused on making Grafana OnCall as easy to use as possible. We want to make it easier to reduce errors with your on-call schedules, create schedule and escalation templates quickly, and fit on-call management into your existing as-code patterns.

Read Post

Grafana

Read more about Get started with Grafana OnCall and Terraform

Healthchecks + Squadcast Integration: Routing Alerts Made Easy

Aug 26, 2022 By Vishal Padghan In Squadcast

Healthchecks is a cron job monitoring service which listens to HTTP requests and email messages ("pings") from your cron jobs and scheduled tasks ("checks"). It lets you update your job to send an HTTP request to the ping URL every time the job runs. When your job does not ping Healthchecks.io on time, then you will receive an alert! If you use Healthchecks for your monitoring needs, you can now integrate it with Squadcast to route detailed alerts from Healthchecks to the right users in Squadcast.

Read Post

Squadcast

Read more about Healthchecks + Squadcast Integration: Routing Alerts Made Easy

What are Runbooks? And why are they needed?

Aug 25, 2022 By Vardhan NS In Squadcast

Imagine being an Ops engineer in a team just struck by tragedy. Alarms start ringing, and incident response is in full force. It may sound like the situation is in control. WRONG! There's panic everywhere. The on-call team is scrambling for the heavenly door to redemption. But, the only thing that doesn't stop - Stakeholder Inquiries. This situation is bad. But it could be worse. Now imagine being a less-experienced Ops engineer in a relatively small on-call team struck by tragedy. If you don't have sufficient guidance, let alone moral support- you're toast.

Read Post

Squadcast

Read more about What are Runbooks? And why are they needed?

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Using incidents to level up your teams

What's New: Updates to PagerDuty Process Automation Software & PagerDuty Runbook Automation, Integrations, and More!

Software Metrics Every SRE Team Should Measure

RESOLVE '22: Bit by bit

Round Robin Escalation: An Efficient Way to Distribute On-Call Responsibilities

Bridging the gap between Engineering and Customer Support during incidents

The Five Main Components of a Fully Developed EHR System

Get started with Grafana OnCall and Terraform

Healthchecks + Squadcast Integration: Routing Alerts Made Easy

What are Runbooks? And why are they needed?

Monthly Archive

Follow Us