Abstract:The Internet era is an era of information explosion. By 2022, the global Internet users have reached more than 4 billion, and the social media users have exceeded 3 billion. People face a lot of news content every day, and it is almost impossible to get interesting information by browsing all the news content. Under this background, personalized news recommendation technology has been widely used, but it still needs to be further optimized and improved. In order to better push the news content of interest to different readers, users' satisfaction with major news websites should be further improved. This study proposes a new recommendation algorithm based on deep learning and reinforcement learning. Firstly, the RL algorithm is introduced based on deep learning. Deep learning is excellent in processing large-scale data and complex pattern recognition, but it often faces the challenge of low sample efficiency when it comes to complex decision-making and sequential tasks. While reinforcement learning (RL) emphasizes learning optimization strategies through continuous trial and error through interactive learning with the environment. Compared with deep learning, RL is more suitable for scenes that need long-term decision-making and trial-and-error learning. By feeding back the reward signal of the action, the system can better adapt to the unknown environment and complex tasks, which makes up for the relative shortcomings of deep learning in these aspects. A scenario is applied to an action to solve the sequential decision problem in the news dissemination process. In order to enable the news recommendation system to consider the dynamic changes in users' interest in news content, the Deep Deterministic Policy Gradient algorithm is applied to the news recommendation scenario. Opposing learning complements and combines Deep Q-network with the strategic network. On the basis of fully summarizing and thinking, this paper puts forward the mode of intelligent news dissemination and push. The push process of news communication information based on edge computing technology is proposed. Finally, based on Area Under Curve a Q-Leaning Area Under Curve for RL models is proposed. This indicator can measure the strengths and weaknesses of RL models efficiently and facilitates comparing models and evaluating offline experiments. The results show that the DDPG algorithm improves the click-through rate by 2.586% compared with the conventional recommendation algorithm. It shows that the algorithm designed in this paper has more obvious advantages in accurate recommendation by users. This paper effectively improves the efficiency of news dissemination by optimizing the push mode of intelligent news dissemination. In addition, the paper also deeply studies the innovative application of intelligent edge technology in news communication, which brings new ideas and practices to promote the development of news communication methods. Optimizing the push mode of intelligent news dissemination not only improves the user experience, but also provides strong support for the application of intelligent edge technology in this field, which has important practical application prospects.

DEN-DQL: Quick Convergent Deep Q-Learning with Double Exploration Networks for News Recommendation.

DRN: A Deep Reinforcement Learning Framework for News Recommendation.

DAN: Deep Attention Neural Network for News Recommendation

Denoising-Guided Deep Reinforcement Learning for Social Recommendation

Q-ADER: An Effective Q-Learning for Recommendation With Diminishing Action Space

Exploiting Structural and Temporal Influence for Dynamic Social-Aware Recommendation

Doubly Constrained Offline Reinforcement Learning for Learning Path Recommendation

Denoising Neural Network for News Recommendation with Positive and Negative Implicit Feedback

Personalized News Recommendation Method with Double-Layer Residual Connections and Double Multi-Head Self-Attention Mechanisms

DKN: Deep Knowledge-Aware Network for News Recommendation

Optimization of news dissemination push mode by intelligent edge computing technology for deep learning

Pseudo Dyna-Q

A Novel Multi-Step Q-learning Method to Improve Data Efficiency for Deep Reinforcement Learning.

Personalized news recommendation based on deep learning

Personalized News Recommendation Algorithm with Enhanced List Information and User Interests

Deep Reinforcement Learning for Personalized Search Story Recommendation

Rethinking Offline Reinforcement Learning for Sequential Recommendation from A Pair-Wise Q-Learning Perspective

Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation

IIDQN: an Incentive Improved DQN Algorithm in EBSN Recommender System

Deep Dynamic Neural Network to trade-off between Accuracy and Diversity in a News Recommender System

DOR: A Novel Dual-Observation-Based Approach for News Recommendation Systems