A Splicing Approach to Best Subset of Groups Selection

Yanhang Zhang,Junxian Zhu,Jin Zhu,Xueqin Wang
DOI: https://doi.org/10.1287/ijoc.2022.1241
IF: 3.288
2023-01-01
INFORMS Journal on Computing
Abstract:Best subset of groups selection (BSGS) is the process of selecting a small part of nonoverlapping groups to achieve the best interpretability on the response variable. It has attracted increasing attention and has far-reaching applications in practice. However, due to the computational intractability of BSGS in high-dimensional settings, developing efficient algorithms for solving BSGS remains a research hotspot. In this paper, we propose a group -splicing algorithm that iteratively detects the relevant groups and excludes the irrelevant ones. Moreover, coupled with a novel group information criterion, we develop an adaptive algorithm to determine the optimal model size. Under certain conditions, it is certifiable that our algorithm can identify the optimal subset of groups in polynomial time with high probability. Finally, we demonstrate the efficiency and accuracy of our methods by compar-ing them with several state-of-the-art algorithms on both synthetic and real-world data sets.
What problem does this paper attempt to address?