Batch Mode Active Learning for Networked Data with Optimal Subset Selection

Haihui Xu,Pengpeng Zhao,Victor S. Sheng,Guanfeng Liu,Lei Zhao,Jian Wu,Zhiming Cui
DOI: https://doi.org/10.1007/978-3-319-21042-1_8
2015-01-01
Abstract:Active learning has increasingly become an important paradigm for classification of networked data, where instances are connected with a set of links to form a network. In this paper, we propose a novel batch mode active learning method for networked data (BMALNeT). Our novel active learning method selects the best subset of instances from the unlabeled set based on the correlation matrix that we construct from the dedicated informativeness evaluation of each unlabeled instance. To evaluate the informativeness of each unlabeled instance accurately, we simultaneously exploit content information and the network structure to capture the uncertainty and representativeness of each instance and the disparity between any two instances. Compared with state-of-the-art methods, our experimental results on three real-world datasets demonstrate the effectiveness of our proposed method.
What problem does this paper attempt to address?