Learning from and improving upon high-throughput screens for protein fitness with Generative AI - Application to BBB-crossing AAV design

Kumari Soniya,Subhalakshmi Kandavel,Anubhooti Anubhooti,Ayan Kashyap,Manasvi Sharma,Prasad Chodavarapu,Siva Kanishka
DOI: https://doi.org/10.1101/2024.12.06.627153
2024-12-11
Abstract:Deep Science is enabling high-throughput experimentation (HTE) to design novel biological entities with desired properties. E.g., blood-brain-barrier (BBB) crossing adeno- associated virus (AAV) vectors, needed for systemic delivery of gene therapies to brain cells, have been identified through innovative directed evolution assays such as M-CREATE and TRACER. But, even these high-throughput experiments are only able to explore a minuscule portion of the large design space of biological entities. In this paper, we introduce autograd based maximization of protein fitness (AutoMaxProFit) to learn from and improve upon protein designs generated with high-throughput screens. Using a transformer based generative AI network and protein language models, we improve upon the design of a variant previously discovered through HTE, to yield 4x better enrichment in brain endothelial cells, as estimated by molecular dynamics (MD) simulations. This shows that Deep Tech models can learn from the observations generated by Deep Science experiments and go on to find more optimal design candidates for application in Biopharma.
Bioinformatics
What problem does this paper attempt to address?