Optimal dichotomization of bimodal Gaussian mixtures

Yan-ni Jhan,Wan-cen Li,Shin-hui Ruan,Jia-jyun Sie,Iebin Lian
DOI: https://doi.org/10.1007/s00362-023-01521-1
2024-01-05
Statistical Papers
Abstract:Despite criticism for loss of information and power, dichotomization of variables is still frequently used in social, behavioral, and medical sciences, mainly because it yields more interpretable conclusions for research outcomes and is useful for decision making. However, the artificial choice of cut-points can be controversial and needs proper justification. In this work, we investigate the properties of point-biserial correlation after dichotomization with underlying bimodal Gaussian mixture distributions. We propose a dichotomous grouping procedure that considers the largest standardized difference in group mean while minimizing information loss.
statistics & probability
What problem does this paper attempt to address?