MLN: Moment localization Network and Samples Selection for Moment Retrieval

Bo Huang,Ya Zhang,Kai Yu
DOI: https://doi.org/10.1145/3301506.3301538
2018-01-01
Abstract:Moment retrieval is a hot problem recently. Given a video, people want to retrieve the clip that matches some semantic meaning. This problem is difficult because both video understanding and language understanding are needed and as a problem of cross-modality retrieval, cross-modality method and similarity metric design are important. Previous research established a general framework for moment retrieval. In this paper, we refine the framework and name it moment localization network and propose two novel sample selection methods to improve training of the model. We do experiments on two large datasets: TACoS and DiDeMo. Results show we outperform previous state-of-the-art method and our sample selection method makes improvement.
What problem does this paper attempt to address?