Latest News

Revisited: How to Run a 24×7 MSP

Dec 3, 2019 By Christopher Gonzalez In OnPage

As 2019 comes to an end, OnPage would like to re-inform MSP teams about the value and importance of offering a 24×7 support service. Twenty-four seven support ensures that client issues are quickly resolved by an after-hours support team. Though 24×7 support is a must-have offering, MSPs must first re-work their internal workflows and policies, ensuring that after-hours servicing is a pain-free venture.

Read Post

OnPage

Read more about Revisited: How to Run a 24×7 MSP

What Is MTBF? Mean Time Between Failures Explained in Detail

Dec 3, 2019 By Carlos Schults In XpoLog

Time for another installment in the series where we explain in detail yet another important metric for tech organizations. After covering MTTD and MTTF, today we answer the question, “What is MTBF?” As the post title makes clear, MTBF stands for “Mean time between failures.” The acronym refers—like the others that came before it—to an important DevOps KPI. But what actually is it? What is it good for? How do I implement it?

Read Post

XpoLog

Read more about What Is MTBF? Mean Time Between Failures Explained in Detail

Danny Mican on his experience as an SRE at Auth0

Dec 2, 2019 By Prakya Vasudevan In Squadcast

Danny is an SRE at Auth0 and currently manages the reliability of systems that authenticate over 2.5 billion logins per month and is expected to have 99.9% (Three Nines) availability. He loves learning about systems and making changes that positively impact client happiness, employee happiness and long term stability and growth.

Read Post

Squadcast

Read more about Danny Mican on his experience as an SRE at Auth0

Opsgenie's Microsoft Teams integration is now available in Microsoft AppSource

Dec 2, 2019 By Shaun Pinney In Opsgenie

Utilizing ChatOps for issue resolution isn’t new, but the benefits of using a single tool for communicating and resolving issues gives it lasting power. The ChatOps model enables teams to take action on their day-to-day work directly from collaboration platforms, including Microsoft Teams. Since many Dev and ITOps folks are using Microsoft Office 365 for their daily work, it was a natural next step for Opsgenie to align with Microsoft Teams.

Read Post

Opsgenie

Read more about Opsgenie's Microsoft Teams integration is now available in Microsoft AppSource

Learn Your Organization's Potential ROI With PagerDuty by Using IDC's Snapshot Tool

Dec 2, 2019 By Jerry Weltsch In PagerDuty

Recently, I wrote about an IDC business value study PagerDuty commissioned and shared some of the results from the research. In summary, after in-depth interviews with eight enterprise customers, IDC applied its proven business value methodology to the aggregated results of those interviews and found that enterprise customers were averaging a three-year return-on-investment (ROI) of 731% and a payback period (break-even point) on their investment in just 4.3 months.

Read Post

PagerDuty

Read more about Learn Your Organization's Potential ROI With PagerDuty by Using IDC's Snapshot Tool

50% cost-savings by automating alarm dispatching at Aquafin

Dec 2, 2019 By Derdack In Derdack

Aquafin is a Belgian company with over 1,000 employees that was established by the Flemish Region in 1990 for the purpose of expanding, operating and pre-financing the wastewater treatment infrastructure in Flanders. Aquafin collects household wastewater from the municipal sewers and transports it to wastewater treatment plants, where it is treated in accordance with European and Flemish standards.

Read Post

Derdack

Read more about 50% cost-savings by automating alarm dispatching at Aquafin

Why incident response automation is top-of-list for CISOs in 2020

Nov 30, 2019 By Noam Morginstin In Exigence

When considering the state of critical incidents in 2019 – it’s no surprise that looking ahead to 2020, CISOs have one of the organization’s most challenging and stressful jobs. During the first half of the year alone 4.1 billion records were compromised, and the average cost of a data breach is now estimated at $3.92 million.

Read Post

Exigence

Read more about Why incident response automation is top-of-list for CISOs in 2020

On-call doesn't have to be stressfull

Nov 29, 2019 By Amrit Balraj In Zenduty

“Being on-call is a critical duty that many operations and engineering teams must undertake to keep their services reliable and available. However, there are several pitfalls in the organization of on-call rotations and responsibilities that can lead to serious consequences for the services and the teams if not avoided.

Read Post

Zenduty

Read more about On-call doesn't have to be stressfull

The Age of Service Mesh

Nov 28, 2019 By Gigi Sayfan In Squadcast

You have built a massively successful system. The users just can't get enough and request new features. Your developers crank out new services on a regular basis. Your DevOps/SRE team configures and scale your Kubernetes cluster (or clusters). As the system becomes more complicated and sophisticated you realize that there are common themes that repeat across all your services.

Read Post

Squadcast

Read more about The Age of Service Mesh

Improving Postmortem Practices with Veteran Google SRE, Steve McGhee

Nov 26, 2019 By Blameless In Blameless

For many SREs, Google’s 99.999% availability seems like an untouchable dream. If anything, getting out of pager hell is already worth celebrating with all your coworkers, friends, and family on the moon. How can teams climb out of it? How can you get to a stage where you have time to proactively prevent incidents, and enter a mental state of calm and control? The rope out of pager hell is weaved with a thorough and rigorous postmortem process.

Read Post

Blameless

Read more about Improving Postmortem Practices with Veteran Google SRE, Steve McGhee

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Revisited: How to Run a 24×7 MSP

What Is MTBF? Mean Time Between Failures Explained in Detail

Danny Mican on his experience as an SRE at Auth0

Opsgenie's Microsoft Teams integration is now available in Microsoft AppSource

Learn Your Organization's Potential ROI With PagerDuty by Using IDC's Snapshot Tool

50% cost-savings by automating alarm dispatching at Aquafin

Why incident response automation is top-of-list for CISOs in 2020

On-call doesn't have to be stressfull

The Age of Service Mesh

Improving Postmortem Practices with Veteran Google SRE, Steve McGhee

Monthly Archive

Follow Us