Learning Clustering for Motion Segmentation

Xun Xu,Le Zhang,Loong-Fah Cheong,Zhuwen Li,Ce Zhu
DOI: https://doi.org/10.1109/tcsvt.2021.3069094
IF: 5.859
2022-03-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Subspace clustering has been extensively studied from the hypothesis-and-test, algebraic, and spectral clustering-based perspectives. Most assume that only a single type/class of subspace is present. Generalizations to multiple types are non-trivial, plagued by challenges such as choice of types and numbers of models, sampling imbalance and parameter tuning. In many real world problems, data may not lie perfectly on a linear subspace and hand designed linear subspace models may not fit into these situations. In this work, we formulate the multi-type subspace clustering problem as one of learning non-linear subspace filters via deep multi-layer perceptrons (mlps). The response to the learnt subspace filters serve as the feature embedding that is clustering-friendly, i.e., points of the same clusters will be embedded closer together through the network. For inference, we apply K-means to the network output to cluster the data. Experiments are carried out on synthetic data and real world motion segmentation problems, producing state-of-the-art results.
engineering, electrical & electronic
What problem does this paper attempt to address?