Abstract:Due to the highly dynamic changes in wireless network topologies, efficiently obtaining network status information and flexibly forwarding data to improve communication quality of service are important challenges. This article introduces an intelligent routing algorithm (DRL-PPONSA) based on proximal policy optimization deep reinforcement learning with network situational awareness under a software-defined wireless networking architecture. First, a specific data plane is designed for network topology construction and data forwarding. The control plane collects network traffic information, sends flow tables, and uses a GCN-GRU prediction mechanism to perceive future traffic change trends to achieve network situational awareness. Second, a DRL-based data forwarding mechanism is designed in the knowledge plane. The predicted network traffic matrix and topology information matrix are treated as the environment for DRL agents, while next-hop adjacent nodes are treated as executable actions. Accordingly, action selection strategies are designed for different network conditions to achieve more intelligent, flexible, and efficient routing control. The reward function is designed using network link information and various reward and penalty mechanisms. Additionally, importance sampling and gradient clipping techniques are employed during gradient updating to enhance convergence speed and stability. Experimental results show that DRL-PPONSA outperforms traditional routing methods in network throughput, delay, packet loss rate, and wireless node distance. Compared to value-function-based Dueling DQN routing, the convergence speed is significantly improved, and the convergence effect is more stable. Simultaneously, its consumption of hardware storage space is reduced, and efficient routing decisions can be made in real-time using the current network state information.

Compact Learning Model for Dynamic Off-Chain Routing in Blockchain-Based IoT

A Multipath Routing for Payment Channel Networks for Internet of Things Microtransactions

A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks

Robust and energy-efficient RPL optimization algorithm with scalable deep reinforcement learning for IIoT

Secure Deep Reinforcement Learning for Dynamic Resource Allocation in Wireless MEC Networks

Reinforcement learning-based load balancing for heavy traffic Internet of Things

Deep Reinforcement Learning for Online Resource Allocation in IoT Networks: Technology, Development, and Future Challenges

Deep reinforcement learning approach for computation offloading in blockchain-enabled communications systems

GTD3-NET: A deep reinforcement learning-based routing optimization algorithm for wireless networks

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

Q-Learning Improved Lightweight Consensus Algorithm for Blockchain-Structured Internet of Things

Deep Reinforcement Learning Based Performance Optimization in Blockchain-Enabled Internet of Vehicle

Blockchain-Aided Network Resource Orchestration in Intelligent Internet of Things

Performance Comparison of Different Deep Reinforcement Learning Algorithms for Task Scheduling Problem in Blockchain-Enabled Internet of Vehicles

Energy-efficient deep Q-network: reinforcement learning for efficient routing protocol in wireless internet of things

Real-Time Recursive Routing in Payment Channel Network: A Bidding-based Design

Scalable Deep Reinforcement Learning-Based Online Routing for Multi-Type Service Requirements

An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement Learning

DRL-D: Revenue-Aware Online Service Function Chain Deployment Via Deep Reinforcement Learning

Performance Optimization for Blockchain-Enabled Industrial Internet of Things (IIoT) Systems: A Deep Reinforcement Learning Approach