Learning to Classify Faster Using Spiking Neural Networks

Pranav Machingal,Thousif,Shirin Dora,Suresh Sundaram,Qinggang Meng
DOI: https://doi.org/10.1109/IJCNN54540.2023.10191334
2023-01-01
Abstract:This paper develops a new approach to estimate predicted class probabilities in deep Spiking Neural Networks (SNN) that encourages faster classification. The proposed approach utilizes the temporal separation between the first spikes generated by the output neurons to estimate the predicted class probabilities which are then used with cross entropy loss for training the network. This maximizes the separation between the first spikes generated by the neuron associated with the correct class and neurons associated with other classes. Higher classification performance is obtained by maximising the temporal separation, which also drives the correct class neuron to spike earlier in the simulation. As a consequence, the predicted class may be determined from the first spike in the output layer, leading to quicker classification. The sensitivity factor for each neuron in the network is estimated via error-backpropagation during training. Using Spike Timing Dependent Plasticity (STDP) regulated by the estimated sensitivity factors, the network weights are updated. It results that the learning method is termed as Temporal Separation Modulated Spike Timing Dependent Plasticity (TSM-STDP). On the benchmark MNIST dataset, the performance of TSM-STDP has been assessed, and the evaluation results are compared with those of other learning methods for SNNs. Additionally, a histogram of the output layer's first spikes demonstrated that the right class neurons spiked earlier in the simulation than other class neurons, enabling faster classification. On real-world Attention Deficit Hyperactivity Disorder (ADHD) detection dataset, the effectiveness of TSM-STDP has also been assessed and compared with other available approaches. The performance comparison results clearly show that TSM-STDP can achieve classification performance comparable to other existing learning algorithms on benchmark and real-world datasets while requiring less time for classification.
What problem does this paper attempt to address?