Brain-inspired Action Generation with Spiking Transformer Diffusion Policy Model

Qianhao Wang,Yinqian Sun,Enmeng Lu,Qian Zhang,Yi Zeng
2024-11-15
Abstract:Spiking Neural Networks (SNNs) has the ability to extract spatio-temporal features due to their spiking sequence. While previous research has primarily foucus on the classification of image and reinforcement learning. In our paper, we put forward novel diffusion policy model based on Spiking Transformer Neural Networks and Denoising Diffusion Probabilistic Model (DDPM): Spiking Transformer Modulate Diffusion Policy Model (STMDP), a new brain-inspired model for generating robot action trajectories. In order to improve the performance of this model, we develop a novel decoder module: Spiking Modulate De coder (SMD), which replaces the traditional Decoder module within the Transformer architecture. Additionally, we explored the substitution of DDPM with Denoising Diffusion Implicit Models (DDIM) in our frame work. We conducted experiments across four robotic manipulation tasks and performed ablation studies on the modulate block. Our model consistently outperforms existing Transformer-based diffusion policy method. Especially in Can task, we achieved an improvement of 8%. The proposed STMDP method integrates SNNs, dffusion model and Transformer architecture, which offers new perspectives and promising directions for exploration in brain-inspired robotics.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the poor performance of existing Transformer - based spiking neural network models in generating robot motion trajectories. Specifically, the paper proposes a new spiking - Transformer - based diffusion strategy model - Spiking Transformer Modulate Diffusion Policy Model (STMDP), aiming to improve the accuracy of motion trajectory generation in robotic manipulation tasks by integrating spiking neural networks (SNNs), diffusion models and Transformer architectures. To achieve this goal, the author develops a new decoding module - Spiking Modulate Decoder (SMD) to replace the traditional Transformer decoding module, and explores the possibility of using Denoising Diffusion Implicit Model (DDIM) to replace Diffusion Probability Model (DDPM). Verified by experiments on four robotic manipulation tasks, this model significantly improves performance in multiple tasks, especially in the "Can" task, with an 8% performance improvement. This indicates that the STMDP method provides a new perspective and a promising research direction for brain - inspired robotics.