Method for locating unlearned activities in video through image query

Zhao Zhou,Jiang Weihao,Zhang Zhu,Lin Zhijie,Song Jingkuan,Cai Deng,Chen Mosha,Qiu Wei
2019-01-01
Abstract:The invention discloses a method for locating unlearned activities in a video through image query. According to the method, a novel regional self-attention method is designed through relative positioncoding to learn regional representation of a fine-grained image, so that the influence of semantically unnecessary content in image query can be eliminated; a multi-layer stacked converter encoder isused, and multi-step fusion and reasoning of image and video contents are established, so that fuzzy positioning of inaccurate image query is processed; a sequence sensitive locator is used for directly retrieving the boundary of time, so that the boundary of a target fragment can be accurately determined. Compared with a general action positioning method, the method breaks through the limitationof predefined actions, and can be used for positioning unlearned activities in the video through image query. Compared with a traditional method, the method has the advantage that the effect obtainedin action positioning of the unmodified video is better.
What problem does this paper attempt to address?