Latest Videos

Why Gremlin: Today's complex applications need a different approach to reliability

Dec 16, 2024 By Gremlin In Gremlin

Cloud-based distributed applications have changed how we need to approach reliability and resiliency. How do you make your applications reliable? Here’s Gremlin CEO Josh Leslie to tell you how. Today’s dynamic applications are too complex and constantly changing for humans to wrap their heads around. This means the reliability approaches that worked ten years ago simply won’t be enough. As a technology company (and these days, every company is a technology company), you need to take a different, programmatic approach to testing and improving the reliability of your applications.

View Video

Gremlin

Read more about Why Gremlin: Today's complex applications need a different approach to reliability

Test for the common failures that cause 80% of outages with Gremlin

Dec 16, 2024 By Gremlin In Gremlin

80% of failures at the infrastructure layer come from the same core gaps in reliability. Jeff Nickoloff, Gremlin Principal Engineer, goes over how Reliability Management test suites help improve reliability across your organization. Are you waiting for the other reliability shoe to drop and hoping that you actually fixed core resilience issues? Or do you know for sure that you’re resilient to common reliability issues?

View Video

Gremlin

Read more about Test for the common failures that cause 80% of outages with Gremlin

Integrating Gremlin with your observability tools

Nov 14, 2024 By Gremlin In Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. To get the most value out of Chaos Engineering and reliability testing, you need a way to observe your service’s behavior. Observability tools offer insight into how your systems are performing, but observability on its own isn’t enough. You need a way to monitor your systems while testing their reliability so you can determine whether your service passed or failed a test.

View Video

Gremlin

Read more about Integrating Gremlin with your observability tools

Building Resilience from Architecture to Production with AWS & Gremlin

Nov 8, 2024 By Gremlin In Gremlin

Unreliable software can have a painful impact on your customers and your business—something we’ve all seen and felt during high-profile outages. And while building on the cloud with AWS unlocks improved scaling and reliability capabilities, the complexity of modern distributed systems can potentially introduce outage-causing reliability risks. How can you be sure your systems are resilient to failure when they’re based on complex architecture, built by hundreds of teams, and are being updated almost constantly?

View Video

Gremlin

Read more about Building Resilience from Architecture to Production with AWS & Gremlin

Office Hours: How to test serverless applications using Failure Flags

Oct 10, 2024 By Gremlin In Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Serverless applications are ideal for deploying scalable applications without having to manage infrastructure. However, this also makes it difficult to test their reliability. It’s easy to simulate a network outage or latency when you have direct access to the host that your software’s running on. What do you do when you only have control over the code?

View Video

Gremlin

Read more about Office Hours: How to test serverless applications using Failure Flags

How Visa Cross Border Solutions Reduces Outages by Testing System Resilience in Their SDLC

Oct 7, 2024 By Gremlin In Gremlin

For global financial services companies, reliability must be built-in and validated before and after shipping to production. Resilience testing is crucial for verifying the reliability of your applications under real-world conditions. But ad-hoc testing and exploratory experiments aren't sufficient: you need to run automated, standardized tests at global scale.

View Video

Gremlin

Read more about How Visa Cross Border Solutions Reduces Outages by Testing System Resilience in Their SDLC

Office Hours: Get better reliability on AWS with our new release

Sep 12, 2024 By Gremlin In Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Cloud platforms make it easier than ever to deploy massively scalable, distributed workloads, but this is a double-edged sword. There are reliability challenges unique to the cloud that didn’t exist before. Failed migrations, recurring incidents, and reliability toil take their toll.

View Video

Gremlin

Read more about Office Hours: Get better reliability on AWS with our new release

Achieving SLO Success with Golden Signals and Reliability Testing

Aug 28, 2024 By Gremlin In Gremlin

The four Golden Signals are an easy and effective way to measure the most important aspects of a system, and when paired with a reliability management platform like Gremlin, they help you proactively meet your SLOs so you can meet your legal obligations and deliver the perfect customer experience.

View Video

Gremlin

Read more about Achieving SLO Success with Golden Signals and Reliability Testing

5 essential resilience tests for a successful cloud migration

Aug 8, 2024 By Gremlin In Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Migrating to the cloud usually means faster deployments and easier scalability, but it also means latency. Cloud applications communicate over distributed networks, and while these networks are fast, little bits of latency can quickly add up.

View Video

Gremlin

Read more about 5 essential resilience tests for a successful cloud migration

Are you testing for known reliability vulnerabilities?

Aug 1, 2024 By Gremlin In Gremlin

Are you testing for known reliability vulnerabilities? "Risks have different priorities, but ultimately we want to be aware of those risks. Just like we want our security team to go scan for known vulnerabilities, our reliability team should be scanning for known vulnerabilities. And those are easy things we should go address. There's a second part of it, which is kind of just good engineering testing, which is: Hey, we have a set of test cases that we know need to pass.

View Video

Gremlin

Read more about Are you testing for known reliability vulnerabilities?

Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

Why Gremlin: Today's complex applications need a different approach to reliability

Test for the common failures that cause 80% of outages with Gremlin

Integrating Gremlin with your observability tools

Building Resilience from Architecture to Production with AWS & Gremlin

Office Hours: How to test serverless applications using Failure Flags

How Visa Cross Border Solutions Reduces Outages by Testing System Resilience in Their SDLC

Office Hours: Get better reliability on AWS with our new release

Achieving SLO Success with Golden Signals and Reliability Testing

5 essential resilience tests for a successful cloud migration

Are you testing for known reliability vulnerabilities?

Monthly Archive

Follow Us