Abstract:Recommender systems rely heavily on increasing computation resources to improve their business goal. By deploying computation-intensive models and algorithms, these systems are able to inference user interests and exhibit certain ads or commodities from the candidate set to maximize their business goals. However, such systems are facing two challenges in achieving their goals. On the one hand, facing massive online requests, computation-intensive models and algorithms are pushing their computation resources to the limit. On the other hand, the response time of these systems is strictly limited to a short period, e.g. 300 milliseconds in our real system, which is also being exhausted by the increasingly complex models and algorithms. In this paper, we propose the computation resource allocation solution (CRAS) that maximizes the business goal with limited computation resources and response time. We comprehensively illustrate the problem and formulate such a problem as an optimization problem with multiple constraints, which could be broken down into independent sub-problems. To solve the sub-problems, we propose the revenue function to facilitate the theoretical analysis, and obtain the optimal computation resource allocation strategy. To address the applicability issues, we devise the feedback control system to help our strategy constantly adapt to the changing online environment. The effectiveness of our method is verified by extensive experiments based on the real dataset from <a class="link-external link-http" href="http://Taobao.com" rel="external noopener nofollow">this http URL</a>. We also deploy our method in the display advertising system of Alibaba. The online results show that our computation resource allocation solution achieves significant business goal improvement without any increment of computation cost, which demonstrates the efficacy of our method in real industrial practice.

RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

Computation Resource Allocation Solution in Recommender Systems

RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems

CPMR: Context-Aware Incremental Sequential Recommendation with Pseudo-Multi-Task Learning

Collaborative Topic Regression for Online Recommender Systems: an Online and Bayesian Approach

Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems

Long-term Recommender System Based on ACP Framework

Towards Personalized Federated Multi-Scenario Multi-Task Recommendation

Whole-Chain Recommendations

MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems

Personalized Multi-task Training for Recommender System

A Survey on Multi-Behavior Sequential Recommendation

Optimizing Ranking Algorithm in Recommender System Via Deep Reinforcement Learning

Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction

Deep Pareto Reinforcement Learning for Multi-Objective Recommender Systems

Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems

Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation

A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations

Parallel Knowledge Enhancement Based Framework for Multi-behavior Recommendation

Leveraging Pointwise Prediction with Learning to Rank for Top-N Recommendation

A Model-based Multi-Agent Personalized Short-Video Recommender System