VCSOD: A Video Conversion Scheme Based on Salient Object Detection Algorithm

Lin Li,Yaoyao Yin,Xiaojun Zhou,Yuhua Qiu,Xingxing Cheng,Li Song
DOI: https://doi.org/10.1007/978-981-99-0923-0_58
2023-01-01
Abstract:Short videos on mobile phones are becoming more and more popular. How to convert horizontal screen videos into high-quality vertical screen videos in batches has become a difficult point in production. In this paper, we propose a video conversion scheme based on salient object detection algorithm. A 3D fully-convolutional network architecture is employed in the detection of consecutive frames. A lightweight image salient detection model is used for single-frame detection. By filtering and clustering the target coordinate points detected by the continuous frame and single frame respectively, the key coordinate value of the salient object in the continuous frame is obtained. Use these key coordinate points to smoothly synthesize vertical video. This solution can be applied to industrial production on a large scale, saving labor costs and mass-producing higher-quality vertical-screen videos.
What problem does this paper attempt to address?