Neocortical plasticity: an unsupervised cake but no free lunch
Eilif B. Muller,Philippe Beaudoin
DOI: https://doi.org/10.48550/arXiv.1911.08584
2019-11-16
Abstract:The fields of artificial intelligence and neuroscience have a long history of fertile bi-directional interactions. On the one hand, important inspiration for the development of artificial intelligence systems has come from the study of natural systems of intelligence, the mammalian neocortex in particular. On the other, important inspiration for models and theories of the brain have emerged from artificial intelligence research. A central question at the intersection of these two areas is concerned with the processes by which neocortex learns, and the extent to which they are analogous to the back-propagation training algorithm of deep networks. Matching the data efficiency, transfer and generalization properties of neocortical learning remains an area of active research in the field of deep learning. Recent advances in our understanding of neuronal, synaptic and dendritic physiology of the neocortex suggest new approaches for unsupervised representation learning, perhaps through a new class of objective functions, which could act alongside or in lieu of back-propagation. Such local learning rules have implicit rather than explicit objectives with respect to the training data, facilitating domain adaptation and generalization. Incorporating them into deep networks for representation learning could better leverage unlabelled datasets to offer significant improvements in data efficiency of downstream supervised readout learning, and reduce susceptibility to adversarial perturbations, at the cost of a more restricted domain of applicability.
Neurons and Cognition,Artificial Intelligence,Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is about the differences between the learning process of the neocortex and current deep - learning models, especially how to improve the data efficiency, generalization ability and transfer ability of deep networks through unsupervised learning methods. Specifically:
1. **Differences between unsupervised learning and supervised learning**: The paper discusses the learning methods of the neocortex when processing a large amount of unlabeled data, which are significantly different from the current deep - learning models that mainly rely on a large amount of labeled data for supervised learning. The neocortex can learn effective representations from a large amount of sensory input without explicit labels.
2. **Neural plasticity mechanisms**: The paper explores the physiological characteristics of neurons, synapses and dendrites in the neocortex, especially local learning rules, such as NMDA spikes and calcium - driven synaptic plasticity. These mechanisms may inspire the design of new unsupervised learning algorithms.
3. **Adversarial sample problem**: The paper points out that current deep - learning models are vulnerable to adversarial samples, that is, a tiny perturbation can lead to wrong results in the model output. This vulnerability can be alleviated by designing more robust unsupervised learning methods.
4. **Generalization and transfer learning**: The paper emphasizes that unsupervised learning methods can improve the generalization ability and transfer learning ability of the model, making it perform better when facing new tasks or new data. This is similar to the efficient learning ability of the neocortex when processing unseen data.
5. **No - free - lunch theorem**: Although unsupervised learning methods have the above advantages, they also have limitations. Unsupervised learning algorithms in the neocortex usually require specific input structures and time scales, which limit their universality. Therefore, unsupervised learning methods cannot completely replace supervised learning, but can play an important role in certain scenarios.
In summary, this paper aims to explore how to use the unsupervised learning mechanisms of the neocortex to improve deep - learning models, especially in terms of data efficiency, generalization ability and robustness.