An Information-Theoretic Approach to Universal Feature Selection in High-Dimensional Inference.

Shao-Lun Huang,Anuran Makur,Gregory W. Wornell,Lizhong Zheng
DOI: https://doi.org/10.1109/isit.2017.8006746
2024-01-01
Foundations and Trends® in Communications and Information Theory
Abstract:We develop an information theoretic framework for addressing feature selection in applications where the inference task is not specified in advance and the data is from a large alphabet. We introduce a natural notion of universality for such problems, and show that locally optimal solutions are straightforward to obtain, admit natural interpretations via information geometry, have computationally efficient implementations, and represent a practically useful learning methodology. Our development also reveals the key role of Hirschfeld-Gebelein-Renyi maximal correlation and the alternating conditional expectations (ACE) algorithm in such problems.
What problem does this paper attempt to address?