Large-Scale Graph-Based Semi-Supervised Learning Via Tree Laplacian Solver

Yan-Ming Zhang,Xu-Yao Zhang,Xiao-Tong Yuan,Cheng-Lin Liu
DOI: https://doi.org/10.1609/aaai.v30i1.10218
2016-01-01
Abstract:Graph-based Semi-Supervised learning is one of the most popular and successful semi-supervised learning methods. Typically, it predicts the labels of unlabeled data by minimizing a quadratic objective induced by the graph, which is unfortunately a procedure of polynomial complexity in the sample size n. In this paper, we address this scalability issue by proposing a method that approximately solves the quadratic objective in nearly linear time. The method consists of two steps: it first approximates a graph by a minimum spanning tree, and then solves the tree-induced quadratic objective function in O(n) time which is the main contribution of this work. Extensive experiments show the significant scalability improvement over existing scalable semi-supervised learning methods.
What problem does this paper attempt to address?