Integrative Data Analysis Where Partial Covariates Have Complex Nonlinear Effects by Using Summary Information from an External Data

Jia Liang,Shuo Chen,Peter Kochunov,L. Elliot Hong,Chixiang Chen
DOI: https://doi.org/10.1080/00031305.2024.2368799
2024-08-28
The American Statistician
Abstract:A full parametric and linear specification may be insufficient to capture complicated patterns in studies exploring complex features, such as those investigating age-related changes in brain functional abilities. Alternatively, a partially linear model (PLM) consisting of both parametric and nonparametric elements may have a better fit. This model has been widely applied in economics, environmental science, and biomedical studies. In this article, we introduce a novel statistical inference framework that equips PLM with high estimation efficiency by effectively synthesizing summary information from external data into the main analysis. Such an integrative scheme is versatile in assimilating various types of reduced models from the external study. The proposed method is shown to be theoretically valid and numerically convenient, and it ensures a high-efficiency gain compared to classic methods in PLM. Our method is further validated using two data applications by evaluating the risk factors of brain imaging measures and blood pressure.
statistics & probability
What problem does this paper attempt to address?