Double-mixing semiparametric logistic regression with unknown sizes

Wei Zhang
DOI: https://doi.org/10.48550/arXiv.math/0609581
2006-09-21
Abstract:Binomial data with unknown sizes often appear in biological and medical sciences and are usually overdispersed. All previous methods used parametric models and only considered overdispersion due to the variation of sizes. The proposed semiparametric model considers overdispersion due to the variation of sizes and that of probabilities. By doing this, it can include variations caused by observations, missing covariates, and random measurement errors in covariates. An Expectation Conditional Maximization algorithm is provided to stabilize the loglikelihood optimization. Selecting the number of support points of the mixing distributions and the bootstrap methods are also discussed. Simulation is done to evaluate the performance of the proposed model. Two real examples are used to illustrate the proposed model.
Statistics Theory
What problem does this paper attempt to address?