Latest Videos

Introduction to Project Management in ServiceDesk Plus

May 5, 2020 By ManageEngine In ManageEngine

In this video, we'll go over configurations that are key to implementing an effective project management process in ServiceDesk Plus.

View Video

ManageEngine

Read more about Introduction to Project Management in ServiceDesk Plus

Getting started with Grafana Loki - under 4 minutes

May 5, 2020 By Grafana In Grafana

Introducing Loki by Grafana, a new logging backend, optimized for users running Prometheus and Kubernetes. Loki is a great match with Grafana for searching, visualizing and exploring your logs natively. Loki is the latest 100% open source project from the team at Grafana Labs.

View Video

Grafana

Read more about Getting started with Grafana Loki - under 4 minutes

PagerTree Notification Rules

May 5, 2020 By PagerTree In PagerTree

PagerTree intelligent on-call alert routing gives teams flexible schedules, escalations, & reliable notifications via email, SMS, voice, chatbots, & smartphone app. Transcript below.

View Video

PagerTree

Read more about PagerTree Notification Rules

Destination:Decentralization | Charity Majors | The Socio-technical Path to High Performing Teams

May 5, 2020 By Honeycomb In Honeycomb

View Video

Honeycomb

Read more about Destination:Decentralization | Charity Majors | The Socio-technical Path to High Performing Teams

Atlassian: SRE in the spotlight-Operating always-on services when the world is counting on you- Liz

May 5, 2020 By Honeycomb In Honeycomb

View Video

Honeycomb

Read more about Atlassian: SRE in the spotlight-Operating always-on services when the world is counting on you- Liz

Performing chaos in a serverless world Gunnar Grosch Failover Conf 2020

May 5, 2020 By Gremlin In Gremlin

Chaos engineering is the practice of hypothesis testing through planned experiments to gain a better understanding of a system’s behavior. The principles of chaos engineering have been around for years, and we have now reached the point where chaos engineering has gone from just being a buzzword and practice used by a few large organizations in very specific fields, to it being put in to use by companies of all sizes and industries.

View Video

Gremlin

Read more about Performing chaos in a serverless world Gunnar Grosch Failover Conf 2020

Swim Don't Sink: Why Training Matters to a Site Reliability Engineering Practice Jennifer Petoff

May 5, 2020 By Gremlin In Gremlin

Do you offer training to the engineers in your organization or do you throw them off the deep end to “sink or swim”? Providing training and education is universally important to set team members up for success in your organization and is critical for establishing a thriving Site Reliability Engineering (SRE) or DevOps practice and culture in the first place.

View Video

Gremlin

Read more about Swim Don't Sink: Why Training Matters to a Site Reliability Engineering Practice Jennifer Petoff

Fight, Flight, or Freeze - Releasing Organizational Trauma Matt Stratton Failover Conf 2020

May 5, 2020 By Gremlin In Gremlin

When humans are faced with a traumatic experience, our brains kick in with survival mechanisms. These mechanisms are the familiar fight or flight response, but can also include the freeze response - which occurs when we are terrified or feel that there is no chance of escape.

View Video

Gremlin

Read more about Fight, Flight, or Freeze - Releasing Organizational Trauma Matt Stratton Failover Conf 2020

Y2K and Other Disappointing Disasters: Risk Reduction and Harm Mitigation Heidi Waterhouse

May 5, 2020 By Gremlin In Gremlin

Every disaster is a concatenation of smaller failures. How can we design software and processes to accept that we live in an imperfect world? Explore the concepts of resiliency, harm reduction, over-engineering, and planning for failure with real examples.

View Video

Gremlin

Read more about Y2K and Other Disappointing Disasters: Risk Reduction and Harm Mitigation Heidi Waterhouse

How to fail with Serverless Jeremy Daly Failover Conf 2020

May 5, 2020 By Gremlin In Gremlin

Everything fails all the time. Knowing how to deal with these failures in serverless applications becomes essential to building resilient, highly-available systems. In traditional monolithic applications, catching errors and handling retries is relatively straightforward. But as our systems become more distributed, we now have multiple (often asynchronous) components processing events from several sources, all with vastly different retry behaviors and failure mechanisms. Utilizing old patterns can cause errors to get swallowed, creating brittle, unreliable systems that are difficult to debug and hard to maintain.

View Video

Gremlin

Read more about How to fail with Serverless Jeremy Daly Failover Conf 2020

Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

Introduction to Project Management in ServiceDesk Plus

Getting started with Grafana Loki - under 4 minutes

PagerTree Notification Rules

Destination:Decentralization | Charity Majors | The Socio-technical Path to High Performing Teams

Atlassian: SRE in the spotlight-Operating always-on services when the world is counting on you- Liz

Performing chaos in a serverless world Gunnar Grosch Failover Conf 2020

Swim Don't Sink: Why Training Matters to a Site Reliability Engineering Practice Jennifer Petoff

Fight, Flight, or Freeze - Releasing Organizational Trauma Matt Stratton Failover Conf 2020

Y2K and Other Disappointing Disasters: Risk Reduction and Harm Mitigation Heidi Waterhouse

How to fail with Serverless Jeremy Daly Failover Conf 2020

Monthly Archive

Follow Us