Abstract:Sufficient dimension reduction (SDR) is a popular class of regression methods which aim to find a small number of linear combinations of covariates that capture all the information of the responses i.e., a central subspace. The majority of current methods for SDR focus on the setting of independent observations, while the few techniques that have been developed for clustered data assume the linear transformation is identical across clusters. In this article, we introduce random effects SDR, where cluster-specific random effect central subspaces are assumed to follow a distribution on the Grassmann manifold, and the random effects distribution is characterized by a covariance matrix that captures the heterogeneity between clusters in the SDR process itself. We incorporate random effect SDR within a model-based inverse regression framework. Specifically, we propose a random effects principal fitted components model, where a two-stage algorithm is used to estimate the overall fixed effect central subspace, and predict the cluster-specific random effect central subspaces. We demonstrate the consistency of the proposed estimators, while simulation studies demonstrate the superior performance of the proposed approach compared to global and cluster-specific SDR approaches. We also present extensions of the above model to handle mixed predictors, demonstrating how random effects SDR can be achieved in the case of mixed continuous and binary covariates. Applying the proposed methods to study the longitudinal association between the life expectancy of women and socioeconomic variables across 117 countries, we find log income per capita, infant mortality, and income inequality are the main drivers of a two-dimensional fixed effect central subspace, although there is considerable heterogeneity in how the country-specific central subspaces are driven by the predictors.

Random effects model-based sufficient dimension reduction for independent clustered data

Adjusting Inverse Regression for Predictors with Clustered Distribution

Dimension Reduction Estimation for Central Mean Subspace with Missing Multivariate Response.

Learning Heterogeneity in Causal Inference Using Sufficient Dimension Reduction

An Estimating Equation Approach to Dimension Reduction for Longitudinal Data.

Testing the Linear Mean and Constant Variance Conditions in Sufficient Dimension Reduction

New forest-based approaches for sufficient dimension reduction

A Principal Square Response Forward Regression Method for Dimension Reduction

Extending the Scope of Inverse Regression Methods in Sufficient Dimension Reduction

Asymptotic results for nonparametric regression estimators after sufficient dimension reduction estimation

On Estimating Regression-Based Causal Effects Using Sufficient Dimension Reduction

A new sufficient dimension reduction method via rank divergence

Estimating average treatment effect on the treated via sufficient dimension reduction

A selective review of sufficient dimension reduction for multivariate response regression

Forest-based Approaches for SDR

Robust sufficient dimension reduction via α-distance covariance

Missing Data Analysis with Sufficient Dimension Reduction

A Note on Sliced Inverse Regression with Missing Predictors.

Impact of Sufficient Dimension Reduction in Nonparametric Estimation of Causal Effect

Functional sufficient dimension reduction through distance covariance

A robust and efficient approach to causal inference based on sparse sufficient dimension reduction