Cisplatin resistance-related multi-omics differences and the establishment of machine learning models

Qihai Sui,Zhencong Chen,Zhengyang Hu,Yiwei Huang,Jiaqi Liang,Guoshu Bi,Yunyi Bian,Mengnan Zhao,Cheng Zhan,Zongwu Lin,Qun Wang,Lijie Tan
DOI: https://doi.org/10.1186/s12967-022-03372-0
2022-04-11
Abstract:Objectives: Platinum-based chemotherapies are currently the first-line treatment of non-small cell lung cancer. This study will improve our understanding of the causes of resistance to cisplatin, especially in lung adenocarcinoma (LUAD) and provide a reference for therapeutic decisions in clinical practice. Methods: Cancer Cell Line Encyclopedia (CCLE), The Cancer Genome Atlas (TCGA) and Zhongshan hospital affiliated to Fudan University (zs-cohort) were used to identify the multi-omics differences related to platinum chemotherapy. Cisplatin-resistant mRNA and miRNA models were constructed by Logistic regression, classification and regression tree and C4.5 decision tree classification algorithm with previous feature selection performed via least absolute shrinkage and selection operator (LASSO). qRT-PCR and western-blotting of A549 and H358 cells, as well as single-cell Seq data of tumor samples were applied to verify the tendency of certain genes. Results: 661 cell lines were divided into three groups according to the IC50 value of cisplatin, and the top 1/3 (220) with a small IC50 value were defined as the sensitive group while the last 1/3 (220) were enrolled in the insensitive group. TP53 was the most common mutation in the insensitive group, in contrast to TTN in the sensitive group. 1348 mRNA, 80 miRNA, and 15 metabolites were differentially expressed between 2 groups (P < 0.05). According to the LASSO penalized logistic modeling, 6 of the 1348 mRNAs, FOXA2, BATF3, SIX1, HOXA1, ZBTB38, IRF5, were selected as the associated features with cisplatin resistance and for the contribution of predictive mRNA model (all of adjusted P-values < 0.001). Three of 6 (BATF3, IRF5, ZBTB38) genes were finally verified in cell level and patients in zs-cohort. Conclusions: Somatic mutations, mRNA expressions, miRNA expressions, metabolites and methylation were related to the resistance of cisplatin. The models we created could help in the prediction of the reaction and prognosis of patients given platinum-based chemotherapies.
What problem does this paper attempt to address?