AltFS: Agency-light Feature Selection with Large Language Models in Deep Recommender Systems

Pengyue Jia,Zhaocheng Du,Yichao Wang,Xiangyu Zhao,Xiaopeng Li,Yuhao Wang,Qidong Liu,Huifeng Guo,Ruiming Tang
2024-12-12
Abstract:Feature selection is crucial in recommender systems for improving model efficiency and predictive performance. Traditional methods rely on agency models, such as decision trees or neural networks, to estimate feature importance. However, this approach is inherently limited, as the agency models may fail to learn effectively in all scenarios due to suboptimal training conditions (e.g., feature collinearity, high-dimensional sparsity, and data insufficiency). In this paper, we propose AltFS, an Agency-light Feature Selection method for deep recommender systems. AltFS integrates semantic reasoning from Large Language Models (LLMs) with task-specific learning from agency models. Initially, LLMs will generate a semantic ranking of feature importance, which is then refined by an agency model, combining world knowledge with task-specific insights. Extensive experiments on three public datasets from real-world recommender platforms demonstrate the effectiveness of AltFS. Our code is publicly available for reproducibility.
Information Retrieval
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the feature selection problem in deep recommender systems (DRS). Specifically, traditional methods rely on surrogate models (such as decision trees or neural networks) to estimate the importance of features, but these methods have the following limitations in practical applications: 1. **Poor training conditions**: For example, problems such as feature collinearity, high - dimensional sparsity, and insufficient data may cause surrogate models to be unable to learn effectively. 2. **Neglect of the dependency relationships between features**: Traditional surrogate models are difficult to capture the complex dependency relationships between features, such as feature collinearity and complementarity. To solve these problems, this paper proposes AltFS (Agency - light Feature Selection), a lightweight feature selection method that combines the semantic reasoning of large - language models (LLMs) and task - specific learning. In this way, AltFS can more accurately evaluate the importance of features in different application scenarios, thereby improving the performance of the recommender system. ### Main contributions of AltFS 1. **Combination of world knowledge and task - specific information**: AltFS is the first to combine world - knowledge priors with surrogate - based feature selection, solving the inaccuracy caused by relying solely on surrogate models. 2. **Iterative prompting method**: A context - aware prompt iterative method is designed, enabling LLMs to iteratively select effective features according to their world knowledge. 3. **Bridging network**: A novel bridging network is introduced to integrate feature importance from the world - knowledge and recommender - task spaces in a lightweight and end - to - end manner. 4. **Experimental verification**: Extensive experiments were carried out on three public datasets from real - world online service platforms to verify the effectiveness of this method. ### Formula presentation To ensure the correctness and readability of the formulas, the following are some of the key formulas involved in the paper: - **Feature importance score update**: \[ f_{i_k}^t = 1-\frac{t}{N} \] where \( f_{i_k}^t \) represents the importance score of the feature selected by the \( k \) - th LLM at the \( t \) - th step, and \( N \) is the number of feature fields. - **Calculation of comprehensive weights**: \[ h_n^*=\sum_{k = 0}^{K}\sum_{t = 0}^{N}\delta(F_n, f_k^t)\cdot(f_{i_k}^t\cdot\hat{w}_k^*) \] where \( F_n \) is the \( n \) - th feature field, \( \delta(F_n, f_k^t) \) is an indicator function, and \( \hat{w}_k^* \) is a trained weighted gate. Through these improvements, AltFS can still maintain stable superior performance in the case of scarce data, and the selected features have strong transferability and are suitable for various deep - recommendation models.