Unsupervisedly Learned Latent Graphs as Transferable Representations

Zhilin Yang,Jake Zhao,Bhuwan Dhingra,Kaiming He,William W. Cohen,Ruslan Salakhutdinov,Yann LeCun
2018-01-01
Abstract:Modern deep transfer learning approaches have mainly focused on learning emph{generic} feature vectors from one task that are transferable to other tasks, such as word embeddings in language and pretrained convolutional features in vision. However, these approaches usually transfer unary features and largely ignore more structured graphical representations. This work explores the possibility of learning generic latent graphs that capture dependencies between pairs of data units (e.g., words or pixels) from large-scale unlabeled data and transferring the graphs to downstream tasks. Our proposed transfer learning framework improves performance on various tasks including question answering, natural language inference, sentiment analysis, and image classification. We also show that the learned graphs are generic enough to be transferred to different types of input embeddings or features.
What problem does this paper attempt to address?