Latest News

Mention the on-call situation in Slack channels

Jul 10, 2024 By Falit Jain In Pagerly

Get each team's current oncall automatically and tag them into any Slack topic. To bring up the current oncall in any channel, topic, or conversation, use @Pagerly. You may also use this to automate responses. ‍

Read Post

Pagerly

Read more about Mention the on-call situation in Slack channels

Decoding Severity: A Guide to Differentiating Major vs Critical Incidents

Jul 9, 2024 By Spandan Pal In Squadcast

Recognizing the difference between major and critical incidents is essential for IT operations, as downtime can result in significant financial losses for businesses. Gartner highlights that effective incident management can cut downtime by as much as 40% . Major incidents disrupt business operations but are typically confined to specific systems or processes.

Read Post

Squadcast

Read more about Decoding Severity: A Guide to Differentiating Major vs Critical Incidents

Behind the scenes: Launching On-call

Jul 9, 2024 By Henry Course In Incident.io

March 5th was a big day for incident.io as we released our on-call product to the world. Nine months of listening to our customers, coding, fixing, testing, and polishing came together for our biggest product launch to date. Releasing On-call was a huge milestone and represented the next step in our journey as a company.

Read Post

Incident.io

Read more about Behind the scenes: Launching On-call

Align ServiceOps with incident context to meet ITOps goals

Jul 9, 2024 By Sam Osborn In BigPanda

ServiceOps is a technology-enabled approach that unifies IT operations and IT service management (ITSM) teams to improve incident management. In a recent survey of more than 400 global IT leaders by Enterprise Management Associates (EMA), 96% of respondents reported positive results from implementing the approach. Adoption rates are high: 75% have either an active effort or a formal initiative to streamline collaboration between ITSM and ITOps teams.

Read Post

BigPanda

Read more about Align ServiceOps with incident context to meet ITOps goals

Round Robin escalation policies: do's and don'ts

Jul 9, 2024 By Ashley Sawatsky In Rootly

The concept of Round Robin comes from sports. And it has nothing to do with anyone called Robin, but the french word ruban (ribbon). In a Round Robin tournament, all participants face each other by taking turns. When applied to on-call schedules, a Round Robin escalation policy means that responders assigned to a level will take turns responding to alerts. When is this strategy useful and when isn’t?

Read Post

Rootly

Read more about Round Robin escalation policies: do's and don'ts

What is an Incident Timeline and How Do You Create One?

Jul 8, 2024 By Blameless In Blameless

Incidents are unavoidable in software development and IT. As a Site Reliability Engineer (SRE), one of the tools you’ll use frequently is an incident timeline. The incident timeline provides a real-time report on any incident, including alerts, system updates, issue severity changes, manual chat entries, and more.

Read Post

Blameless

Read more about What is an Incident Timeline and How Do You Create One?

SRE vs. DevOps vs. Platform Engineering

Jul 8, 2024 By Blameless In Blameless

The age of information technology has rapidly expanded to include a wide range of necessary roles to manage and optimize operational frameworks. Site Reliability Engineers (SREs), Development Operations (DevOps), and Platform Engineers have become invaluable within this digital landscape. Here, you’ll learn more about each role, how they differ, and what they bring to the table.

Read Post

Blameless

Read more about SRE vs. DevOps vs. Platform Engineering

Onboarding yourself as an engineer at incident.io

Jul 5, 2024 By Pip Taylor In Incident.io

At incident.io we use infrastructure as code for configuring everything we can, and we feel that there’s no reason we should exclude our own product from that. As well as configuring things like Google Cloud Platform, Sentry and Spacelift via our infrastructure repo, we also configure incident.io. On your first day as an engineer here, the first PR that you make is to our infrastructure repo.

Read Post

Incident.io

Read more about Onboarding yourself as an engineer at incident.io

Runbooks vs Playbooks: Differences & How to Choose

Jul 4, 2024 By Lauren Craigie In Cortex

Are you documenting your incident response process, and are unsure which you should be writing—a runbook or a playbook? Could these be two names for the same kind of document? Read on to learn about two different and complementary structures: playbooks and runbooks. The two are used in tandem, and because the terms are sometimes used interchangeably, they can be mistaken for one another.

Read Post

Cortex

Read more about Runbooks vs Playbooks: Differences & How to Choose

On-Call Life: Setting Expectations

Jul 3, 2024 By Ritika Bramhe In OnPage

Imagine this: You’ve just been offered a new job in tech. Maybe it’s your first job right out of college, and you’ve only heard of being on-call in passing conversations up until this point. Or, perhaps you’ve been in tech your whole life but never had to be on-call until today. Or, maybe you’re contemplating whether on-call is for you because your company is dangling some extra cash (because, who doesn’t like extra money!).

Read Post

OnPage

Read more about On-Call Life: Setting Expectations

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Mention the on-call situation in Slack channels

Decoding Severity: A Guide to Differentiating Major vs Critical Incidents

Behind the scenes: Launching On-call

Align ServiceOps with incident context to meet ITOps goals

Round Robin escalation policies: do's and don'ts

What is an Incident Timeline and How Do You Create One?

SRE vs. DevOps vs. Platform Engineering

Onboarding yourself as an engineer at incident.io

Runbooks vs Playbooks: Differences & How to Choose

On-Call Life: Setting Expectations

Monthly Archive

Follow Us