Autonomous Boundary of Human-Machine Collaboration System Based on Reinforcement Learning

Qianqian Zhang,Yun-Bo Zhao,Yu Kang
DOI: https://doi.org/10.1109/anzcc50923.2020.9318326
2020-01-01
Abstract:This paper provides a human-machine collaborative control framework, including artificial intelligence decision systems, human-level control, arbiter judgment, and learning of autonomous boundary, so that human suggestions are incorporated into the training process of decisions, assisting agents to learn quickly control decision tasks. Based on the model-free deep reinforcement learning algorithm HITL-AC, the human feedback (reward or punishment) is connected with the reward of the agent, so that the agent continuously tries to find a better boundary during the system's operation, avoiding defects of pre-fixed boundary. This formulation improves the data efficiency of reinforcement learning and plays a guiding role in seeking human intervention when the agent is in an uncertain environmental state during the test use phase. The fourth section of the paper gives a training demonstration of a realtime environment (bipedal walker). Compared with existing standard reinforcement learning methods that do not consider boundary concepts, the method with boundary information mentioned in this article can accelerate the process of agent reinforcement learning during the training phase, and seek human help when guiding the dangerous state of the agent during the test phase. And the real-time optimization algorithm (HITL-AC) for the boundary is better than the fixed value algorithm (HITL-FIX). This is beneficial for solving real-world problems, further proving the feasibility and effectiveness of the proposed framework and method.
What problem does this paper attempt to address?