Learning Paired-associate Images with An Unsupervised Deep Learning Architecture

Ti Wang,Daniel L. Silver
DOI: https://doi.org/10.48550/arXiv.1312.6171
2014-01-11
Abstract:This paper presents an unsupervised multi-modal learning system that learns associative representation from two input modalities, or channels, such that input on one channel will correctly generate the associated response at the other and vice versa. In this way, the system develops a kind of supervised classification model meant to simulate aspects of human associative memory. The system uses a deep learning architecture (DLA) composed of two input/output channels formed from stacked Restricted Boltzmann Machines (RBM) and an associative memory network that combines the two channels. The DLA is trained on pairs of MNIST handwritten digit images to develop hierarchical features and associative representations that are able to reconstruct one image given its paired-associate. Experiments show that the multi-modal learning system generates models that are as accurate as back-propagation networks but with the advantage of a bi-directional network and unsupervised learning from either paired or non-paired training examples.
Neural and Evolutionary Computing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?