Abstract:Many real-world applications involve data from multiple modalities and thus exhibit the viewheterogeneity. For example, user modeling on social media might leverage both the topology of the underlying social network and the content of the users' posts; in the medical domain, multiple views could be X-ray images taken at different poses. To date, various techniques have been proposed to achieve promising results, such as canonical correlation analysis based methods, etc. In the meanwhile, it is critical for decision-makers to be able to understand the prediction results from these methods. For example, given the diagnostic result that a model provided based on the X-ray images of a patient at different poses, the doctor needs to know why the model made such a prediction. However, state-of-the-art techniques usually suffer from the inability to utilize the complementary information of each view and to explain the predictions in an interpretable manner. To address these issues, in this paper, we propose a deep coattention network for multi-view subspace learning, which aims to extract both the common information and the complementary information in an adversarial setting and provide robust interpretations behind the prediction to the end-users via the co-attention mechanism. In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation by incorporating the classifier into our model. This improves the quality of latent representation and accelerates the convergence speed. Finally, we develop an efficient iterative algorithm to find the optimal encoders and discriminator, which are evaluated extensively on synthetic and real-world data sets. We also conduct a case study to demonstrate how the proposed method robustly interprets the predictions on an image data set.

End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture

Joint Domain Alignment and Discriminative Feature Learning for Unsupervised Deep Domain Adaptation

Interpreting the Prediction Process of A Deep Network Constructed from Supervised Topic Models

Max-Margin Deep Diverse Latent Dirichlet Allocation With Continual Learning

Learning Topic Models by Belief Propagation

Deep PLS: A Lightweight Deep Learning Model for Interpretable and Efficient Data Analytics

Sparse Q-learning with Mirror Descent

Deep Learning for Content-Based Image Retrieval: A Comprehensive Study

Boosting Data-Driven Mirror Descent with Randomization, Equivariance, and Acceleration

Deep Co-Attention Network for Multi-View Subspace Learning

Deep LDA Hashing.

MedLDA: maximum margin supervised topic models

A New Approach to Speeding Up Topic Modeling

Deep Generative LDA.

DLBD: A Self-Supervised Direct-Learned Binary Descriptor

Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC

Learning by Teaching, with Application to Neural Architecture Search

Deep Boosting: Joint Feature Selection and Analysis Dictionary Learning in Hierarchy

Deep learning model construction for a semi-supervised classification with feature learning

Associated Learning: Decomposing End-to-end Backpropagation based on Auto-encoders and Target Propagation

Deep shared learning and attentive domain mapping for cross-domain recommendation