Feature Interaction Based Online Streaming Feature Selection Via Buffer Mechanism.

Yan Lv,Yaojin Lin,Xiangyan Chen,Chenxi Wang,Shaozi Li
DOI: https://doi.org/10.1002/cpe.6435
2021-01-01
Abstract:Feature selection is a nontrivial preprocessing technique in many practical application domains. There are three key challenges with respect to real-world data. Firstly, the dimensionality of data keeps growing and will achieve hundreds of millions. Secondly, the data has the characteristic of high-dimensional and small-size. Thirdly, practical applications need to process each feature in an online manner. However, most of the previous methods only pay much attention to solving the challenges of high dimensionality and online stream. To address all issues above, we propose OFSI, in this article, an O_nline streaming F_eature S_election based on feature I_nteraction method for feature selection. OFSI can effectively select the streaming features that are strongly related to each other in high-dimensional and small-size data, via using feature interaction. Furthermore, to address upcoming features that arrive by groups, we present a new group-OFSI algorithm for online group feature selection. An extensive experiment using a series of benchmark data sets shows that the proposed two algorithms, OFSI and group-OFSI, outperform six state-of-the-art online streaming feature selection methods.
What problem does this paper attempt to address?