Abstract:A vast literature in statistics, biometrics, and econometrics is concerned with the analysis of binary and polychotomous response data. The classical approach fits a categorical response regression model using maximum likelihood, and inferences about the model are based on the associated asymptotic theory. The accuracy of classical confidence statements is questionable for small sample sizes. In this article, exact Bayesian methods for modeling categorical response data are developed using the idea of data augmentation. The general approach can be summarized as follows. The probit regression model for binary outcomes is seen to have an underlying normal regression structure on latent continuous data. Values of the latent data can be simulated from suitable truncated normal distributions. If the latent data are known, then the posterior distribution of the parameters can be computed using standard results for normal linear models. Draws from this posterior are used to sample new latent data, and the process is iterated with Gibbs sampling. This data augmentation approach provides a general framework for analyzing binary regression models. It leads to the same simplification achieved earlier for censored regression models. Under the proposed framework, the class of probit regression models can be enlarged by using mixtures of normal distributions to model the latent data. In this normal mixture class, one can investigate the sensitivity of the parameter estimates to the choice of “link function,” which relates the linear regression estimate to the fitted probabilities. In addition, this approach allows one to easily fit Bayesian hierarchical models. One specific model considered here reflects the belief that the vector of regression coefficients lies on a smaller dimension linear subspace. The methods can also be generalized to multinomial response models with J > 2 categories. In the ordered multinomial model, the J categories are ordered and a model is written linking the cumulative response probabilities with the linear regression structure. In the unordered multinomial model, the latent variables have a multivariate normal distribution with unknown variance-covariance matrix. For both multinomial models, the data augmentation method combined with Gibbs sampling is outlined. This approach is especially attractive for the multivariate probit model, where calculating the likelihood can be difficult.

Naive Bayes Classifiers and One-hot Encoding of Categorical Variables

One generalization of the naive bayes to fuzzy sets and the design of the fuzzy naive bayes classifier

Fuzzy Naive Bayes Classifier Based on Fuzzy Clustering

A Na&#x00ef;ve Bayesian Classifier in Categorical Uncertain Data Streams

On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Fairness Implications of Encoding Protected Categorical Attributes

Bayesian-Estimation Methods for Categorical-Data with Misclassifications

Probabilistic learning of boolean functions applied to the binary classification problem with categorical covariates

Encoding high-cardinality string categorical variables

Naïve Bayes Classification in R

Reasoning about Bayesian Network Classifiers

Comparative Study on the Performance of Categorical Variable Encoders in Classification and Regression Tasks

Out of (the) bag—encoding categorical predictors impacts out-of-bag samples

Finding The Optimal Feature Representations For Bayesian Network Learning

What Are the Differences Between Bayesian Classifiers and Mutual-Information Classifiers?

Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier

Sufficient Representations for Categorical Variables

Optimal partition of feature using Bayesian classifier

BAYESIAN INFERENCE FOR CATEGORICAL DATA WITH MISCLASSIFICATION ERRORS

Bayesian Analysis of Binary and Polychotomous Response Data

A model-free Bayesian classifier