High-Throughput Measurement and Machine Learning-Based Prediction of Collision Cross Sections for Drugs and Drug Metabolites

Dylan H. Ross,Ryan P. Seguin,Allison M. Krinsky,Libin Xu
DOI: https://doi.org/10.1101/2021.05.13.443945
2021-05-15
Abstract:Drug metabolite identification is a bottleneck of drug metabolism studies. Ion mobility-mass spectrometry (IM-MS) enables the measurement of collision cross section (CCS), a unique physical property related to an ion's gas-phase size and shape, which can be used to increase the confidence in the identification of unknowns. A current limitation to the application of IM-MS to the identification of drug metabolites is the lack of reference CCS values. In this work, we present the production of a large-scale database of drug and drug metabolite CCS values, assembled using high-throughput in vitro drug metabolite generation and a rapid IM-MS analysis with automated data processing. Subsequently, we used this database to train a machine learning-based CCS prediction model, employing a combination of conventional 2D molecular descriptors and novel 3D descriptors. This novel prediction model enables the prediction of different CCS values for different protomers, conformers, and positional isomers for the first time.
What problem does this paper attempt to address?