DMVOS: Discriminative Matching for Real-time Video Object Segmentation

Peisong Wen,Ruolin Yang,Qianqian Xu,Chen Qian,Qingming Huang,Runmin Cong,Jianlou Si
DOI: https://doi.org/10.1145/3394171.3414035
2020-01-01
Abstract:Though recent methods on semi-supervised video object segmentation (VOS) have achieved an appreciable improvement of segmentation accuracy, it is still hard to get an adequate speed-accuracy balance when facing real-world application scenarios. In this work, we propose Discriminative Matching for real-time Video Object Segmentation (DMVOS), a real-time VOS framework with high-accuracy to fill this gap. Based on the matching mechanism, our framework introduces discriminative information through the Isometric Correlation module and the Instance Center Offset module. Specifically, the isometric correlation module learns a pixel-level similarity map with semantic discriminability, and the instance center offset module is applied to exploit the instance-level spatial discriminability. Experiments on two benchmark datasets show that our model achieves state-of-the-art performance with extremely fast speed, for example, J&F of 87.8% on DAVIS-2016 validation set with 35 milliseconds per frame.
What problem does this paper attempt to address?