A Robust Approach for Scene Text Detection and Tracking in Video.

Yang Wang,Lan Wang,Feng Su
DOI: https://doi.org/10.1007/978-3-030-00764-5_28
2018-01-01
Abstract:The detection of scene text in videos is of great value in various content-based video applications such as video analysis and retrieval. In this paper, we present a robust scene text detection and tracking method for videos. We first propose an effective deep neural network model for detecting text in individual video frames, which enhances the EAST model by introducing deconvolution layers and inception modules. We then present a correlation filter based tracking algorithm for text in the video and further combine detection and tracking results, which effectively enhances the final video text detection performance. The proposed method outperforms other state-of-the-art methods in experiments on public scene text video datasets.
What problem does this paper attempt to address?