A versatile active learning workflow for optimization of genetic and metabolic networks
Amir Pandi,Christoph Diehl,Ali Yazdizadeh Kharrazi,Léon Faure,Scott A. Scholz,Maren Nattermann,David Adam,Nils Chapin,Yeganeh Foroughijabbari,Charles Moritz,Nicole Paczia,Niña Socorro Cortina,Jean-Loup Faulon,Tobias J. Erb
DOI: https://doi.org/10.1101/2021.12.28.474323
2021-12-28
Abstract:Abstract The study, engineering and application of biological networks require practical and efficient approaches. Current optimization efforts of these systems are often limited by wet lab labor and cost, as well as the lack of convenient, easily adoptable computational tools. Aimed at democratization and standardization, we describe METIS, a modular and versatile active machine learning workflow with a simple online interface for the optimization of biological target functions with minimal experimental datasets. We demonstrate our workflow for various applications, from simple to complex gene circuits and metabolic networks, including several cell-free transcription and translation systems, a LacI -based multi-level controller and a 27-variable synthetic CO 2 -fixation cycle (CETCH cycle). Using METIS, we could improve above systems between one and two orders of magnitude compared to their original setup with minimal experimental efforts. For the CETCH cycle, we explored the combinatorial space of ∼10 25 conditions with only 1,000 experiments to yield the most efficient CO 2 -fixation cascade described to date. Beyond optimization, our workflow also quantifies the relative importance of individual factors to the performance of a system. This allows to identify so far unknown interactions and bottlenecks in complex systems, which paves the way for their hypothesis-driven improvement, which we demonstrate for the LacI multi-level controller that we were able to improve by 34-fold after having identified resource competition as limiting factor. Overall, our workflow opens the way for convenient optimization and prototyping of genetic and metabolic networks with customizable adjustments according to user experience, experimental setup, and laboratory facilities.