Terahertz Spectroscopy and Machine Learning Algorithm for Non-Destructive Evaluation of Protein Conformation

Can Cao,Zhaohui Zhang,Xiaoyan Zhao,Tianyao Zhang
DOI: https://doi.org/10.1007/s11082-020-02345-1
IF: 3
2020-01-01
Optical and Quantum Electronics
Abstract:Given the condition that protein conformation and activity are highly susceptible to environment factors such as temperature and pH, evaluation of protein conformation and activity is urgently needed in many fields. For example, most protein drugs need a stable and proper environment during production, storage and transportation, and it’s an enormous challenge to maintain protein activity throughout the whole process. Therefore, it’s necessary to ensure the safety and effectiveness of protein drugs by monitoring their activity before use. In our study, we presented an improved method for non-destructive evaluation of protein conformation and biological activity by terahertz spectroscopy combined with t-SNE-XGBoost. Firstly, bovine serum albumin (BSA) samples heated to different temperature were measured with THz-TDS. The obtained results indicated that native-conformation BSA will undergo transient states in the process of temperature induced denaturation. However, for any single given sample, it’s difficult to identify its conformation and activity directly by using the measured raw terahertz data. Therefore, we applied several different algorithms to the raw data for recognition of BSA samples with different conformation and activity induced by temperature. Finally, the models obtained by different algorithms were evaluated by calculating the root mean standard error of prediction (RMSEP) and the correlation coefficient of prediction ($$R_p$$). The THz-TDS plus t-SNE-XGBoost proved to be an effective non-destructive and label-free method for evaluation of protein conformation and activity. It can provide a new technique in many applications, such as pharmaceutical industry, clinical diagnosis and quality control.
What problem does this paper attempt to address?