A Latent Factor Model for High-Dimensional Binary Data

Jiaxin Shi,Yuan Gao,Rui Pan,Hansheng Wang
DOI: https://doi.org/10.48550/arxiv.2404.08457
2024-01-01
Abstract:In this study, we develop a latent factor model for analysinghigh-dimensional binary data. Specifically, a standard probit model is used todescribe the regression relationship between the observed binary data and thecontinuous latent variables. Our method assumes that the dependency structureof the observed binary data can be fully captured by the continuous latentfactors. To estimate the model, a moment-based estimation method is developed.The proposed method is able to deal with both discontinuity and highdimensionality. Most importantly, the asymptotic properties of the resultingestimators are rigorously established. Extensive simulation studies arepresented to demonstrate the proposed methodology. A real dataset about productdescriptions is analysed for illustration.
What problem does this paper attempt to address?