Webinar On Demand

Using Chaos Engineering To Ensure Kubernetes Reliability

Recorded February 15, 2022

View a Complimentary Webinar Sponsored by Gremlin

Kubernetes can dramatically improve your resource efficiency, save on compute costs, and accelerate software delivery. But the tradeoff for these is a more dynamic environment that can greatly increase the complexity of your systems and put you at risk of prolonged outages when incidents occur.

In this session, we’ll discuss how to design your services to take advantage of Kubernetes’s resiliency features and how you can achieve your reliability goals using Chaos Engineering.

We’ll cover:

  • Balancing reliability and complexity when using Kubernetes
  • Chaos Engineering as a critical component of reliability engineering
  • How to build a Chaos Engineering practice
  • Fundamental Chaos Engineering experiments to establish a solid foundation for reliability

Speaker

Jason Yee , Director of Advocacy, Gremlin
Jason Yee is Director of Advocacy at Gremlin where he helps people build more resilient systems by learning from how they fail. He also leads the internal Chaos Engineering practices to make Gremlin more reliable. Previously, he worked at Datadog, O’Reilly Media, and MongoDB. His pandemic-coping activities include drinking whiskey, cooking everything in a waffle iron, and making chocolate.