Subgroup analysis with concave pairwise fusion penalty for ordinal response

Weirong Li,Wensheng Zhu
DOI: https://doi.org/10.1007/s00362-023-01526-w
2024-01-14
Statistical Papers
Abstract:The growing popularity of data heterogeneity motivates people to identify homogeneous subgroups with identical parameters. Meanwhile, in many fields of recent data science for some applications, such as personalized education and personalized marketing, the massive data are usually recorded as categorical or ordinal variables, which highlights the importance of performing subgroup analysis on those ordinal outcomes. In this paper, we propose a cumulative link model with subject-specific intercepts to detect and identify homogeneous subgroups through concave pairwise fusion penalty for ordinal response, where heterogeneity arises from some unknown or unobserved latent factors. The concave fusion method can simultaneously determine the number of subgroups, identify the group membership, and estimate the regression coefficients. An alternating direction method of multipliers algorithm with concave penalties for the generalized linear regression model with logit link is developed and its convergence property is studied. We also establish the oracle property of the proposed penalized estimator under some mild conditions. Our simulation studies show that the proposed method could recover the heterogeneous subgroup structure effectively when the response of interest is ordinal. Further, the advantages of our method are illustrated by the analysis on a Mathematics Student Performance Data Set of two public schools from the Alentejo region of Portugal.
statistics & probability
What problem does this paper attempt to address?