Abstract:In wireless Internet of Things (IoT) systems, the multi-input multi-output (MIMO) and cognitive radio (CR) techniques are usually involved into the mobile edge computing (MEC) structure to improve the spectrum efficiency and transmission reliability. However, such a CR based MIMO IoT system will suffer from a variety of smart attacks from wireless environments, even the MEC servers in IoT systems are not secure enough and vulnerable to these attacks. In this paper, we investigate a secure communication problem in a cognitive MIMO IoT system comprising of a primary user (PU), a secondary user (SU), a smart attacker and several MEC servers. The target of our system design is to optimize utility of the SU, including its efficiency and security. The SU will choose an idle MEC server that is not occupied by the PU in the CR scenario, and allocates a proper offloading rate of its computation tasks to the server, by unloading such tasks with proper transmit power. In such a CR IoT system, the attacker will select one type of smart attacks. Then two deep reinforcement learning based resource allocation strategies are proposed to find an optimal policy of maximal utility without channel state information(CSI), one of which is the Dyna architecture and Prioritized sweeping based Edge Server Selection (DPESS) strategy, and the other is the Deep Q-network based Edge Server Selection (DESS) strategy. Specifically, the convergence speed of the DESS scheme is significantly improved due to the trained convolutional neural network (CNN) by utilizing the experience replay technique and stochastic gradient descent (SGD). In addition, the Nash equilibrium and existence conditions of the proposed two schemes are theoretically deduced for the modeled MEC game against smart attacks. Compared with the traditional Q-learning algorithm, the average utility and secrecy capacity of the SU can be improved by the proposed DPESS and DESS schemes. Numerical simulations are also presented to verify the better performance of our proposals in terms of efficiency and security, including the higher convergence speed of the DESS strategy.

Learning-Aided Dynamic Access Control in MEC-Enabled Green IoT Networks: A Convolutional Reinforcement Learning Approach

Deep Reinforcement Learning for Dynamic Access Control with Battery Prediction for Mobile-Edge Computing in Green IoT Networks

Reinforcement Learning based Multi-Access Control and Battery Prediction with Energy Harvesting in IoT Systems

Reinforcement Learning-Based Multiaccess Control and Battery Prediction with Energy Harvesting in IoT Systems

Multi-user Resource Control with Deep Reinforcement Learning in IoT Edge Computing

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

Power Control in Energy Harvesting Multiple Access System with Reinforcement Learning.

Reinforcement Learning-based Mobile Edge Computing and Transmission Scheduling for Video Surveillance

Mobile Edge Computing Against Smart Attacks with Deep Reinforcement Learning in Cognitive MIMO IoT Systems

Reinforcement Learning Based Multi-Access Control with Energy Harvesting

Adaptive Request Scheduling and Service Caching for MEC-Assisted IoT Networks: an Online Learning Approach

Deep Reinforcement Learning for Computation and Communication Resource Allocation in Multiaccess MEC Assisted Railway IoT Networks

Resource Allocation Based on Deep Reinforcement Learning in IoT Edge Computing

Security Enhancement for RIS-Aided MEC Systems with Deep Reinforcement Learning

Green-Oriented Offloading and Resource Allocation by Reinforcement Learning in MEC

An Efficient Computation Offloading Approach in Multi-access Edge Computing Using Deep Reinforcement Learning

Deep Reinforcement Learning Approach for Enhancing Profitability in Mobile Edge Computing

Deep Reinforcement Learning Based Computation Offloading and Resource Allocation for MEC

Cache-Aided MEC for IoT: Resource Allocation Using Deep Graph Reinforcement Learning

A Deep Reinforcement Learning Scheme for SCMA-Based Edge Computing in IoT Networks.

Deep Reinforcement Learning-based Dynamic SFC Deployment in IoT-MEC Networks