Statistical Learning Guarantees for Compressive Clustering and Compressive Mixture Modeling

Rémi Gribonval,Gilles Blanchard,Nicolas Keriven,Yann Traonmilin
DOI: https://doi.org/10.48550/arXiv.2004.08085
2021-08-17
Abstract:We provide statistical learning guarantees for two unsupervised learning tasks in the context of compressive statistical learning, a general framework for resource-efficient large-scale learning that we introduced in a companion <a class="link-external link-http" href="http://paper.The" rel="external noopener nofollow">this http URL</a> principle of compressive statistical learning is to compress a training collection, in one pass, into a low-dimensional sketch (a vector of random empirical generalized moments) that captures the information relevant to the considered learning task. We explicitly describe and analyze random feature functions which empirical averages preserve the needed information for compressive clustering and compressive Gaussian mixture modeling with fixed known variance, and establish sufficient sketch sizes given the problem dimensions.
Machine Learning,Information Theory,Statistics Theory
What problem does this paper attempt to address?