Adaptative clustering by minimization of the mixing entropy criterion

Thierry Dumont
DOI: https://doi.org/10.48550/arXiv.2203.11517
2022-03-22
Abstract:We present a clustering method and provide a theoretical analysis and an explanation to a phenomenon encountered in the applied statistical literature since the 1990's. This phenomenon is the natural adaptability of the order when using a clustering method derived from the famous EM algorithm. We define a new statistic, the relative entropic order, that represents the number of clumps in the target distribution. We prove in particular that the empirical version of this relative entropic order is consistent. Our approach is easy to implement and has a high potential of applications. Perspectives of this works are algorithmic and theoretical, with possible natural extensions to various cases such as dependent or multidimensional data.
Statistics Theory,Methodology,Machine Learning
What problem does this paper attempt to address?