Abstract:The efficacy of feature selection methods in dimensionality reduction and enhancing the performance of learning algorithms has been well documented. Traditional feature selection algorithms often grapple with delineating non-linear relationships between features and responses. While deep neural networks excel in capturing such non-linearities, their inherent "black-box" nature detracts from their interpretability. Furthermore, the complexity of deep network architectures can give rise to prolonged training durations and the challenge of vanishing gradients. This study aims to refine network structures, hasten network training, and bolster model interpretability without forfeiting accuracy. This paper delves into a sparse-weighted feature selection approach grounded in convolutional neural networks, termed the low-dimensional sparse-weighted feature selection network (LSWFSNet). LSWFSNet integrates a convolutional selection kernel between the input and convolutional layers, facilitating weighted convolutional calculations on input data while imposing sparse constraints on the selection kernel. Features with significant weights in this kernel are earmarked for subsequent operations in the LSWFSNet computational domain, while those with negligible weights are eschewed to diminish model intricacy. By streamlining the network's input data, LSWFSNet refines the post-convolution feature maps, thus simplifying its structure. Acknowledging the intrinsic interconnections within the data, our study amalgamates diverse sparse constraints into a cohesive objective function. This ensures the convolutional kernel's sparsity while acknowledging the structural dynamics of the data. Notably, the foundational convolutional network in this method can be substituted with any deep convolutional network, contingent upon suitable adjustments to the convolutional selection kernel in relation to input data dimensions. The LSWFSNet model was tested on human emotion electroencephalography (EEG) datasets curated by Shanghai Jiao Tong University. When various sparse constraint methodologies were employed, the convolutional kernel manifested sparsity. Regions in the convolutional selection kernel with non-zero weights were identified as having strong correlations with emotional responses. The empirical outcomes not only resonate with extant neuroscience insights but also supersede the baseline network in accuracy metrics. LSWFSNet's applicability extends to pivotal tasks like keypoint recognition, be it the extraction of salient pixels in facial detection models or the isolation of target attributes in object detection frameworks. This study's significance is anchored in the amalgamation of sparse constraint techniques with deep convolutional networks, supplanting traditional fully connected networks. This fusion amplifies model interpretability and broadens its applicability, notably in image processing arenas.

LassoNet: A Neural Network with Feature Sparsity.

Lassonet: A neural network with feature sparsity

Sparse-Input Neural Network using Group Concave Regularization

The Contextual Lasso: Sparse Linear Models via Deep Neural Networks

Feature Selection for Neural Networks Using Group Lasso Regularization

Training a neural netwok for data reduction and better generalization

Non-linear Feature Selection Based on Convolution Neural Networks with Sparse Regularization

LassoLayer: Nonlinear Feature Selection by Switching One-to-one Links

Feature Selection Using a Neural Network with Group Lasso Regularization and Controlled Redundancy.

Nonlinear Feature Selection Neural Network Via Structured Sparse Regularization

LassoNet: Deep Lasso-Selection of 3D Point Clouds

Sparse neural network regression with variable selection

Feature selection for high‐dimensional regression via sparse LSSVR based on Lp‐norm

Elastic Net with Adaptive Weight for Image Denoising

Sparse-Input Neural Networks for High-dimensional Nonparametric Regression and Classification

Sparsity and smoothness via the fused lasso

Discriminative Lasso

A Sparse-Group Lasso

Consistent Feature Selection for Analytic Deep Neural Networks

Feature Selection Via Non-convex Constraint and Latent Representation Learning with Laplacian Embedding

Least squares after model selection in high-dimensional sparse models