Accelerating drug repurposing for COVID-19 via modeling drug mechanism of action with large scale gene-expression profiles

Lu Han,G. C. Shan,B. F. Chu,H. Y. Wang,Z. J. Wang,S. Q. Gao,W. X. Zhou,G.C. Shan,B.F. Chu,H.Y. Wang,Z.J. Wang,S.Q. Gao,W.X. Zhou
DOI: https://doi.org/10.48550/arXiv.2005.07567
2020-05-15
Quantitative Methods
Abstract:The novel coronavirus disease, named COVID-19, emerged in China in December 2019, and has rapidly spread around the world. It is clearly urgent to fight COVID-19 at global scale. The development of methods for identifying drug uses based on phenotypic data can improve the efficiency of drug development. However, there are still many difficulties in identifying drug applications based on cell picture data. This work reported one state-of-the-art machine learning method to identify drug uses based on the cell image features of 1024 drugs generated in the LINCS program. Because the multi-dimensional features of the image are affected by non-experimental factors, the characteristics of similar drugs vary greatly, and the current sample number is not enough to use deep learning and other methods are used for learning optimization. As a consequence, this study is based on the supervised ITML algorithm to convert the characteristics of drugs. The results show that the characteristics of ITML conversion are more conducive to the recognition of drug functions. The analysis of feature conversion shows that different features play important roles in identifying different drug functions. For the current COVID-19, Chloroquine and Hydroxychloroquine achieve antiviral effects by inhibiting endocytosis, etc., and were classified to the same community. And Clomiphene in the same community inibited the entry of Ebola Virus, indicated a similar MoAs that could be reflected by cell image.
What problem does this paper attempt to address?