Advancements in Recommender Systems: A Comprehensive Analysis Based on Data, Algorithms, and Evaluation

Xin Ma,Mingyue Li,Xuguang Liu
2024-07-10
Abstract:Using 286 research papers collected from Web of Science, ScienceDirect, SpringerLink, arXiv, and Google Scholar databases, a systematic review methodology was adopted to review and summarize the current challenges and potential future developments in data, algorithms, and evaluation aspects of RSs. It was found that RSs involve five major research topics, namely algorithmic improvement, domain applications, user behavior & cognition, data processing & modeling, and social impact & ethics. Collaborative filtering and hybrid recommendation techniques are mainstream. The performance of RSs is jointly limited by four types of eight data issues, two types of twelve algorithmic issues, and two evaluation issues. Notably, data-related issues such as cold start, data sparsity, and data poisoning, algorithmic issues like interest drift, device-cloud collaboration, non-causal driven, and multitask conflicts, along with evaluation issues such as offline data leakage and multi-objective balancing, have prominent impacts. Fusing physiological signals for multimodal modeling, defending against data poisoning through user information behavior, evaluating generative recommendations via social experiments, fine-tuning pre-trained large models to schedule device-cloud resource, enhancing causal inference with deep reinforcement learning, training multi-task models based on probability distributions, using cross-temporal dataset partitioning, and evaluating recommendation objectives across the full lifecycle are feasible solutions to address the aforementioned prominent challenges and unlock the power and value of RSs.The collected literature is mainly based on major international databases, and future research will further expand upon it.
Information Retrieval,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that, in the context of new technologies, scenarios and diverse requirements, recommender systems (RSs) face serious challenges in terms of data, algorithms and evaluation. Specifically, through a systematic review of 286 related research literatures, the paper aims to: 1. **Data problems**: - **Cold Start**: New users or new items lack initial data, making it difficult to infer users' preferences. - **Data Sparsity**: User behavior data is uneven, limiting the comprehensive modeling and prediction of all users or items. - **Data Poisoning**: Malicious users or actors deliberately provide false information or manipulate behaviors, resulting in biased or misleading recommendation results. 2. **Algorithm problems**: - **Interest Drift**: Users' interests, behaviors and needs change over time, showing temporal and dynamic characteristics. - **Device - Cloud Collaboration**: Achieve efficient recommendation under limited computing, storage and bandwidth resources. - **Non - Causal Drivers**: Generate recommendations based on non - causal relationships (such as association rules and user behavior patterns). - **Multitask Conflicts**: Balance possible conflicts between different interests of users. 3. **Evaluation problems**: - **Offline Data Leakage**: In offline evaluation, data leakage may cause distortion of evaluation results. - **Multi - Objective Balancing**: Find a balance point among multiple evaluation indicators to ensure the comprehensive performance of the recommender system. Through the systematic review method, the paper analyzes these challenges in detail and proposes potential solutions, hoping to provide scientific guidance for the research and development of recommender systems.