Self-learning Control for Wavefront Sensorless Adaptive Optics System Through Deep Reinforcement Learning

Ke Hu,Bing Xu,Zhenxing Xu,Lianghua Wen,Ping Yang,Shuai Wang,Lizhi Dong
DOI: https://doi.org/10.1016/j.ijleo.2018.09.160
IF: 3.1
2018-01-01
Optik
Abstract:An aberration correction algorithm for wavefront sensorless adaptive optics (WFSless AO) systems based on deep reinforcement learning is presented. An actor–critic structure is designed to evaluate a control policy through the deep deterministic policy gradient (DDPG) algorithm. The algorithm performance is verified with a set-up simulation environment. According to the correction results, the aberration correction process can be expressed as a Markov decision process (MDP). The method exhibits excellent performances in correction capacity and speed. Similar correction effects are obtained with the stochastic parallel-gradient descent (SPGD) algorithm and WFSless AO based on the general-modes (AOG) algorithm. Moreover, the correction speed is improved by approximately 9 and 2.5 times, respectively.
What problem does this paper attempt to address?