Extreme Trust Region Policy Optimization for Active Object Recognition

Huaping Liu,Yupei Wu,Fuchun Sun
DOI: https://doi.org/10.1109/tnnls.2017.2785233
IF: 14.255
2018-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:In this brief, we develop a deep reinforcement learning method to actively recognize objects by choosing a sequence of actions for an active camera that helps to discriminate between the objects. The method is realized using trust region policy optimization, in which the policy is realized by an extreme learning machine and, therefore, leads to efficient optimization algorithm. The experimental results on the publicly available data set show the advantages of the developed extreme trust region optimization method.
What problem does this paper attempt to address?