Addressing statistical challenges in the analysis of proteomics data with extremely small sample size: a simulation study

Kyung Hyun Lee,Shervin Assassi,Chandra Mohan,Claudia Pedroza
DOI: https://doi.org/10.1186/s12864-024-11018-2
IF: 4.547
2024-11-18
BMC Genomics
Abstract:One of the most promising approaches for early and more precise disease prediction and diagnosis is through the inclusion of proteomics data augmented with clinical data. Clinical proteomics data is often characterized by its high dimensionality and extremely limited sample size, posing a significant challenge when employing machine learning techniques for extracting only the most relevant information. Although there is a wide array of statistical techniques and numerous analysis pipelines employed in proteomics data analysis, it is unclear which of these methods produce the most efficient, reproducible, and clinically meaningful results.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?