Abstract:A general random effects model is proposed that allows for continuous as well as discrete distributions of the responses. Responses can be unrestricted continuous, bounded continuous, binary, ordered categorical or given in the form of counts. The distribution of the responses is not restricted to exponential families, which is a severe restriction in generalized mixed models. Generalized mixed models use fixed distributions for responses, for example the Poisson distribution in count data, which has the disadvantage of not accounting for overdispersion. By using a response function and a thresholds function the proposed mixed thresholds model can account for a variety of alternative distributions that often show better fits than fixed distributions used within the generalized linear model framework. A particular strength of the model is that it provides a tool for joint modeling, responses may be of different types, some can be discrete, others continuous. In addition to introducing the mixed thresholds model parameter sparsity is addressed. Random effects models can contain a large number of parameters, in particular if effects have to be assumed as measurement-specific. Methods to obtain sparser representations are proposed and illustrated. The methods are shown to work in the thresholds model but could also be adapted to other modeling approaches.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the flexibility and adaptability issues when dealing with different types of response variables (such as binary, ordinal categorical, count - type, and continuous response variables) in the random - effects model. Specifically, the paper proposes a general random - effects model framework, which can handle not only response variables with continuous distributions but also those with discrete distributions. The key to this framework lies in that it is not limited to the exponential family distribution, but instead flexibly adapts to various distribution situations by using response functions and threshold functions, thus overcoming the limitations of the fixed - distribution assumption in the generalized linear model.
### Main problems solved by the paper:
1. **Handling multiple types of response variables**: The Mixed Thresholds Model (MTM) proposed in the paper can handle different types of data simultaneously, including continuous, binary, ordinal categorical, and count data. This model allows joint modeling of different types of data under a unified framework, thereby providing more comprehensive data analysis capabilities.
2. **Improving model flexibility**: Traditional Generalized Linear Mixed Models (GLMM) usually assume that the response variable follows a specific distribution (such as the Poisson distribution), which has limitations when dealing with over - dispersed data. By introducing response functions and threshold functions, the paper enables the model to adapt to more diverse distributions, thereby improving the model's flexibility and fitting effect.
3. **Reducing the number of parameters**: The number of parameters in the random - effects model can be very large, especially when measurement - specific effects need to be considered. The paper proposes several methods to obtain a sparser representation, thereby reducing the complexity and number of parameters of the model, and improving the interpretability and computational efficiency of the model.
### Core features of the model:
- **Response functions and threshold functions**: Through the response function \( F(\cdot) \) and the threshold function \( \delta_j(\cdot) \), the model can flexibly adapt to different types of response variables. For example, for continuous response variables, linear or nonlinear threshold functions can be used; for discrete response variables, shifted log - threshold functions can be used, etc.
- **Joint modeling**: The model allows handling different types of response variables under the same framework, such as a combination of continuous variables and ordinal categorical variables, thereby providing more comprehensive data analysis capabilities.
- **Sparse representation**: By introducing sparse representation methods, the number of parameters in the model is reduced, improving the computational efficiency and interpretability of the model.
### Application examples:
- **Rent data**: Using Munich rent index data, it shows how to use the Gumbel distribution to better fit right - skewed monthly rent data.
- **Sleep deprivation study**: Using the sleep deprivation data set, it shows how to use the log - threshold function to handle positive - value variables (such as reaction time).
- **Epileptic seizure data**: Using the epileptic seizure data set, it shows how to use the Gumbel distribution to handle count data and compares it with the Poisson model and the negative binomial model.
- **Political fear data**: Using data from the German Longitudinal Election Study, it shows how to handle ordinal categorical data and analyzes the influence of different covariates on the fear level.
In conclusion, this paper proposes a general and flexible random - effects model framework, aiming to solve the limitations of existing models when dealing with multiple types of response variables and improve the adaptability and fitting effect of the model.