Multi-omics profiling with untargeted proteomics for blood-based early detection of lung cancer

Brian Koh,Manway Liu,Rebecca Almonte,Daniel Ariad,Ghristine Bundalian,Jessica Chan,Jinlyung Choi,Wan-Fang Chou,Rea Cuaresma,Esthelle Hoedt,Lexie Hopper,Yuntao Hu,Anisha Jain,Ehdieh Khaledian,Thidar Khin,Ajinkya Kokate,Joon-Yong Lee,Stephanie Leung,Chi-Hung Lin,Mark Marispini,Hoda Malekpour,Megan Mora,Nithya Mudaliar,Sara Nouri Golmaei,Hao Qian,Madhuvanthi Ramaiah,Saividya Ramaswamy,Purva Ranjan,Guanhua Shu,Peter Spiro,Benjamin Ta,Dijana Vitko,Jacob Waiss,Zachary Yanagihara,Robert Zawada,Jimmy Yi Zeng,Susan Zhang,James Yee,John E. Blume,Chinmay Belthangady,Bruce Wilcox,Philip Ma
DOI: https://doi.org/10.1101/2024.01.03.24300798
2024-01-04
Abstract:Blood-based approaches to detect early-stage cancer provide an opportunity to improve survival rates for lung cancer, the most lethal cancer world-wide. Multiple approaches for blood-based cancer detection using molecular analytes derived from individual ‘omics (cell-free DNA, RNA transcripts, proteins, metabolites) have been developed and tested, generally showing significantly lower sensitivity for early-stage versus late-stage cancer. We hypothesized that an approach using multiple types of molecular analytes, including broad and untargeted coverage of proteins, could identify biomarkers that more directly reveal changes in gene expression and molecular phenotype in response to carcinogenesis to potentially improve detection of early-stage lung cancer. To that end, we designed and conducted one of the largest multi-omics, observational studies to date, enrolling 2513 case and control subjects. Multi-omics profiling detected 113,671 peptides corresponding to 8385 protein groups, 219,729 RNA transcripts, 71,756 RNA introns, and 1801 metabolites across all subject samples. We then developed a machine learning-based classifier for lung cancer detection comprising 682 of these multi-omics analytes. This multi-omics classifier demonstrated 89%, 80%, and 98-100% sensitivity for all-stage, stage I, and stage III-IV lung cancer, respectively, at 89% specificity in a validation set. The application of a multi-omics platform for discovery of blood-based disease biomarkers, including proteins and complementary molecular analytes, enables the noninvasive detection of early-stage lung cancer with the potential for downstaging at initial diagnosis and the improvement of clinical outcomes.
Oncology
What problem does this paper attempt to address?
The paper aims to address the issue of early detection of lung cancer. Specifically, the researchers hypothesize that identifying biomarkers through multi-omics approaches (including extensive untargeted proteomics) can more effectively reveal gene expression and molecular phenotype changes caused by cancer, thereby improving the sensitivity of early lung cancer detection. To test this hypothesis, they designed and implemented a large-scale multi-omics observational study, including 2513 cases and control subjects, and used this data to develop a machine learning-based lung cancer detection classifier. The results showed that this multi-omics classifier achieved detection sensitivities of 89%, 80%, and 98-100% for all stages, stage I, and stages III-IV lung cancer in the validation set, respectively, with a specificity of 89%. This indicates that applying multi-omics platforms to discover disease biomarkers in blood, including proteins and other complementary molecular analytes, can enable non-invasive early detection of lung cancer, with potential clinical application value.