Cross-model Convolutional Neural Network for Multiple Modality Data Representation.

Yanbin Wu,Li Wang,Fan Cui,Hongbin Zhai,Baoming Dong,Jing-Yan Wang
DOI: https://doi.org/10.1007/s00521-016-2824-4
2017-01-01
Neural Computing and Applications
Abstract:A novel data representation method of convolutional neural network (CNN) is proposed in this paper to represent data of different modalities. We learn a CNN model for the data of each modality to map the data of different modalities to a common space and regularize the new representations in the common space by a cross-model relevance matrix. We further impose that the class label of data points can also be predicted from the CNN representations in the common space. The learning problem is modeled as a minimization problem, which is solved by an augmented Lagrange method with updating rules of Alternating direction method of multipliers. The experiments over benchmark of sequence data of multiple modalities show its advantage.
What problem does this paper attempt to address?