Noisy Spiking Actor Network for Exploration

Ding Chen,Peixi Peng,Tiejun Huang,Yonghong Tian
2024-03-07
Abstract:As a general method for exploration in deep reinforcement learning (RL), NoisyNet can produce problem-specific exploration strategies. Spiking neural networks (SNNs), due to their binary firing mechanism, have strong robustness to noise, making it difficult to realize efficient exploration with local disturbances. To solve this exploration problem, we propose a noisy spiking actor network (NoisySAN) that introduces time-correlated noise during charging and transmission. Moreover, a noise reduction method is proposed to find a stable policy for the agent. Extensive experimental results demonstrate that our method outperforms the state-of-the-art performance on a wide range of continuous control tasks from OpenAI gym.
Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address the exploration-exploitation trade-off in Reinforcement Learning (RL), particularly focusing on achieving efficient exploration in continuous control tasks based on Spiking Neural Networks (SNNs). Specifically: 1. **Exploration Method**: Proposes a novel Noisy Spiking Actor Network (NoisySAN) that enhances exploration strategies by introducing time-domain correlated noise during the charging and transmission processes. 2. **Noise Robustness**: Since SNNs have a binary firing mechanism and are highly robust to external noise, traditional local perturbations are ineffective for exploration. The proposed method addresses this issue. 3. **Noise Reduction Method**: To ensure that the agent can find a stable policy after sufficient exploration, the paper also proposes a noise reduction method for non-spiking neurons. 4. **Noise Generation Method**: Introduces colored noise to enhance exploration effects and demonstrates that pink noise performs better than other types of noise. ### Main Contributions 1. Proposes NoisySAN, a novel noisy spiking actor network for exploration in spike-based RL algorithms. 2. Proposes a new noise reduction method for non-spiking neurons in the output layer to stabilize the agent's policy and further improve performance. 3. For the first time, proposes a noise generation method that combines action sequences with spike sequences within SNNs and demonstrates that pink noise outperforms other types of noise. These improvements enable NoisySAN to achieve superior performance over existing methods in a series of continuous control tasks in OpenAI Gym.