Learnable Graph Guided Deep Multi-view Representation Learning Via Information Bottleneck

Liang Zhao,Xiao Wang,Zhenjiao Liu,Ziyue Wang,Zhikui Chen
DOI: https://doi.org/10.1109/tcsvt.2024.3509892
IF: 5.859
2024-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:In real world applications, multi-view data has attracted intensive attention due to the complex and complementary relationship across views. Multi-view representation learning (MvRL) focuses on obtaining consistent feature representation from multi-view data, and becomes a popular topic in multi-view research field. However, the relationship between different samples, i.e., the graph information, is usually ignored or excavated insufficiently in most existing MvRL methods, which only regard graph structure as regularization items instead of graph embedding for multi-view data. Besides, the limited learning capacity of the adopted shallow models is another challenge for MvRL. To tackle them, in this paper, we propose a novel unsupervised deep multi-view representation learning model guided by learnable graph structure, termed as LGG-DMRL. It first captures a multi-view consistent graph from original data based on self-representation learning, and explores the view-specific feature representation of each view by the designed graph guided attention network using the learnt graph. After that, the information bottleneck principle is employed to identify the shared representation across views integrated with the view-specific feature representations, promoting the multi-view complementarity and completeness. Experimental results on five real-world datasets demonstrate the superiority and effectiveness of our proposed LGG-DMRL compared with the recent state-of-the-art multi-view approaches.
What problem does this paper attempt to address?