An enriched approach to combining high-dimensional genomic and low-dimensional phenotypic data

Javier Cabrera,Birol Emir,Ge Cheng,Yajie Duan,Demissie Alemayehu,Yauheniya Cherkas
DOI: https://doi.org/10.1080/10543406.2024.2330203
2024-04-07
Journal of Biopharmaceutical Statistics
Abstract:We describe an approach for combining and analyzing high-dimensional genomic and low-dimensional phenotypic data. The approach leverages a scheme of weights applied to the variables instead of observations and, hence, permits incorporation of the information provided by the low dimensional data source. It can also be incorporated into commonly used downstream techniques, such as random forest or penalized regression. Finally, the simulated lupus studies involving genetic and clinical data are used to illustrate the overall idea and show that the proposed enriched penalized method can select significant genetic variables while keeping several important clinical variables in the final model.
pharmacology & pharmacy,statistics & probability
What problem does this paper attempt to address?