Unsupervised representation learning with Hebbian synaptic and structural plasticity in brain-like feedforward neural networks

Naresh Ravichandran,Anders Lansner,Pawel Herman
2024-06-07
Abstract:Neural networks that can capture key principles underlying brain computation offer exciting new opportunities for developing artificial intelligence and brain-like computing algorithms. Such networks remain biologically plausible while leveraging localized forms of synaptic learning rules and modular network architecture found in the neocortex. Compared to backprop-driven deep learning approches, they provide more suitable models for deploying on neuromorphic hardware and have greater potential for scalability on large-scale computing clusters. The development of such brain-like neural networks depends on having a learning procedure that can build effective internal representations from data. In this work, we introduce and evaluate a brain-like neural network model capable of unsupervised representation learning. It builds on the Bayesian Confidence Propagation Neural Network (BCPNN), which has earlier been implemented as abstract as well as biophyscially detailed recurrent attractor neural networks explaining various cortical associative memory phenomena. Here we developed a feedforward BCPNN model to perform representation learning by incorporating a range of brain-like attributes derived from neocortical circuits such as cortical columns, divisive normalization, Hebbian synaptic plasticity, structural plasticity, sparse activity, and sparse patchy connectivity. The model was tested on a diverse set of popular machine learning benchmarks: grayscale images (MNIST, Fashion-MNIST), RGB natural images (SVHN, CIFAR-10), QSAR (MUV, HIV), and malware detection (EMBER). The performance of the model when using a linear classifier to predict the class labels fared competitively with conventional multi-layer perceptrons and other state-of-the-art brain-like neural networks.
Neural and Evolutionary Computing,Neurons and Cognition
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is to develop a brain - like feed - forward neural network model capable of unsupervised representation learning. Specifically, the author hopes to introduce and evaluate a new model based on the Bayesian Confidence Propagation Neural Network (BCPNN), which can construct effective internal representations from data without relying on external labels. By combining brain - like properties in multiple cerebral cortex circuits (such as cortical columns, divisive normalization, Hebbian synaptic plasticity, structural plasticity, sparse activity, and sparse patch connections), this model aims to improve its performance in different machine - learning benchmark tests and compete with traditional multi - layer perceptrons and other state - of - the - art brain - like neural networks. ### Main problem summary: 1. **Unsupervised representation learning**: How to design a neural network that can automatically extract features from data without the need for external labels. 2. **Integration of brain - like properties**: How to incorporate multiple biological mechanisms in the cerebral cortex (such as Hebbian synaptic plasticity and structural plasticity) into artificial neural networks to improve their performance and biological plausibility. 3. **Practical application ability**: Whether the proposed model can perform well in multiple machine - learning tasks (such as image classification, QSAR prediction, and malware detection) and be as competitive as traditional deep - learning methods. ### Key improvement points: - **Structural plasticity algorithm**: Used to form sparse patch connections, which are formed under data - driven conditions. - **Continuous - value data interface**: Use the population vector coding method to convert continuous - value data features into a form that the BCPNN architecture can handle. - **Theoretical equivalence**: It is proved that the propagation and learning steps of the feed - forward BCPNN model are equivalent to the E - step and M - step of the expectation - maximization (EM) algorithm of the discrete mixture model. Through these improvements, the author hopes to show that this model not only theoretically conforms to the principles of brain - like computing but also can effectively perform unsupervised representation learning tasks in practice.