Performance Comparison of Different Deep Reinforcement Learning Algorithms for Task Scheduling Problem in Blockchain-Enabled Internet of Vehicles

Yang Gao,Pengbo Si,Kaiqi Jin,Teng Sun,Wenjun Wu
DOI: https://doi.org/10.1109/tvt.2023.3248651
IF: 6.8
2023-01-01
IEEE Transactions on Vehicular Technology
Abstract:Recently, blockchain has been widely considered as a promising technology to cope with the security issues in Internet of Vehicles (IoV). However, due to the high energy consumption, large data storage and heavy transmission load of blockchain and the limited resources of IoV devices, the resource management is urgently to be studied. In this paper, we propose a blockchain-based trust trading platform in IoV scenario and formulate the Task Scheduling (TS) problem which selects the transactions to assemble block and effects the utilization of the wireless resources and the performance of blockchain system. The optimization object is designed by jointly considering the characteristics of wireless communications, the Quality of Service (QoS) and the implementation process of blockchain. The DRL algorithms are utilized as the solutions, and MCQ-TS, PG-TS, TDQ-TS and TDAC-TS algorithms are proposed base on several typical DRL methods. The computational complexity of the proposed algorithms is analyzed mathematically. Additionally, a fair and comprehensive comparison of the various proposed DRL methods is also conducted through the complexity analysis and the simulation results. Accordingly, the features and approriate applied scenarios of the proposed algorithms are summarized at last. PG-TS has the best optimization performance while MCQ-TS, TDQ-TS and TDAC-TS perform similar. MCQ-TS has the smallest complexity, and TDQ-TS and TDAC-TS have the best training efficiency.
telecommunications,engineering, electrical & electronic,transportation science & technology
What problem does this paper attempt to address?