teemi: An open-source literate programming approach for iterative design-build-test-learn cycles in bioengineering
Søren D Petersen,Lucas Levassor,Christine M Pedersen,Jan Madsen,Lea G Hansen,Jie Zhang,Ahmad K Haidar,Rasmus J N Frandsen,Jay D Keasling,Tilmann Weber,Nikolaus Sonnenschein,Michael K Jensen,Søren D. Petersen,Christine M. Pedersen,Lea G. Hansen,Ahmad K. Haidar,Rasmus J. N. Frandsen,Jay D. Keasling,Michael K. Jensen
DOI: https://doi.org/10.1371/journal.pcbi.1011929
2024-03-09
PLoS Computational Biology
Abstract:Synthetic biology dictates the data-driven engineering of biocatalysis, cellular functions, and organism behavior. Integral to synthetic biology is the aspiration to efficiently find, access, interoperate, and reuse high-quality data on genotype-phenotype relationships of native and engineered biosystems under FAIR principles, and from this facilitate forward-engineering strategies. However, biology is complex at the regulatory level, and noisy at the operational level, thus necessitating systematic and diligent data handling at all levels of the design, build, and test phases in order to maximize learning in the iterative design-build-test-learn engineering cycle. To enable user-friendly simulation, organization, and guidance for the engineering of biosystems, we have developed an open-source python-based computer-aided design and analysis platform operating under a literate programming user-interface hosted on Github. The platform is called teemi and is fully compliant with FAIR principles. In this study we apply teemi for i) designing and simulating bioengineering, ii) integrating and analyzing multivariate datasets, and iii) machine-learning for predictive engineering of metabolic pathway designs for production of a key precursor to medicinal alkaloids in yeast. The teemi platform is publicly available at PyPi and GitHub. Bioengineering holds fantastic perspectives and is poised to change how we produce foods, materials, and medicines. However, rapid progress is limited by a lack of mechanistic knowledge in even the simplest model organisms, such as bacteria and yeast. Thus, to compensate, we often have to construct and study a large number of engineered cells and select the cells with the greatest potential for the given objective function. The targeted construction of engineered cells is often described as an iterative process of design, build, test, and learn (the DBTL cycle). Literate programming is a paradigm that encourages the combination of text and computer code with the potential to describe all workflows covered by the DBTL cycle. The purpose of the present work is to give a first estimate of the extent to which we can accelerate the individual steps in the DBTL cycle by using end-to-end literate programming workflows. To achieve this, we established an open-source platform called teemi, and used it to optimize production of a key precursor to medicinal alkaloids in yeast. We expect that teemi will enable higher DBTL throughput with fewer errors, better integration of IT tools with laboratory resources, and more effective knowledge capture.
biochemical research methods,mathematical & computational biology