A Platform for Automating Chaos Experiments

Ali Basiri,Aaron Blohowiak,Lorin Hochstein,Casey Rosenthal
DOI: https://doi.org/10.1109/ISSREW.2016.52
2017-02-20
Abstract:The Netflix video streaming system is composed of many interacting services. In such a large system, failures in individual services are not uncommon. This paper describes the Chaos Automation Platform, a system for running failure injection experiments on the production system to verify that failures in non-critical services do not result in system outages.
Software Engineering
What problem does this paper attempt to address?