Weakly Supervised Random Forest for Multi-Label Image Clustering and Segmentation

Yingjie Xia,Qianqian Zhu,Wei Wei
DOI: https://doi.org/10.1145/2671188.2749377
2015-01-01
Abstract:Clustering is a useful statistical tool in data mining and computer vision. Supervised information is introduced to improve the clustering performance. However, labeling each piece of data accurately is extremely expensive when the amount of data is huge. Existing supervised clustering methods handle the huge workload of labeling large amount of data by transferring the bag-level labels into the instance-level descriptors. However, each bag has only one label limits the application scope seriously. In this paper, we propose weakly supervised multi-label clustering, which allows to label a bag of data multiple labels. The key technique is a weakly supervised random forest which can calculate the model parameters with a deterministic annealing strategy to optimize the non-convex objective function. The proposed algorithm is applied to two typical applications, image clustering and segmentation problems. Impressive efficiency in both training and testing stages on the state-of-the-art image data sets is achieved in our experiments.
What problem does this paper attempt to address?