Neural Eigenfunctions Are Structured Representation Learners

Zhijie Deng,Jiaxin Shi,Hao Zhang,Peng Cui,Cewu Lu,Jun Zhu
DOI: https://doi.org/10.48550/arXiv.2210.12637
2022-01-01
Abstract: In this paper, we introduce a scalable method for learning structured, adaptive-length deep representations. Our approach is to train neural networks such that they approximate the principal eigenfunctions of a kernel. We show that, when the kernel is derived from positive relations in a contrastive learning setup, our method outperforms a number of competitive baselines in visual representation learning and transfer learning benchmarks, and importantly, produces structured representations where the order of features indicates degrees of importance. We demonstrate using such representations as adaptive-length codes in image retrieval systems. By truncation according to feature importance, our method requires up to 16$\times$ shorter representation length than leading self-supervised learning methods to achieve similar retrieval performance. We further apply our method to graph data and report strong results on a node representation learning benchmark with more than one million nodes.
What problem does this paper attempt to address?