Consistent Panoramic Video Style Transfer Via Temporal-Spatial Cross Perception

Weiyu Wang,Chunmei Qing,Junpeng Tan,Xiangmin Xu
DOI: https://doi.org/10.1007/978-981-97-5597-4_23
2024-01-01
Abstract:Due to the hyper-view structure of panoramic video and the flicker in stylized video, the temporal-spatial consistency of over-the-horizon should be considered when transferring panoramic video style. To this end, we propose a novel Temporal-Spatial Cross Perception Style Transfer Network (TSCPNet), which considers the inherent characteristics of panoramic video and temporal coherence to enhance the robustness of inter-frame variation. Specifically, based on the cross complementation structure, multiple branches of the continuous dilated convolution are designed to improve the spatial structure correlation in the over-the-horizon. To capture richer stylistic attributes and realism of rendering, we propose a multi-perception semantic fusion module that performs local adaptation and global statistical distribution semantic fusion. Besides, to better balance the temporal-spatial consistency of stylized panoramic videos, we propose a weighted panorama structural similarity loss in the training step. Qualitative and quantitative evaluations show that TSCPNet performs well in panoramic video and panoramic image style transfer. More results and details can be found on https://weiyang001. github.io/TSCPNet/.
What problem does this paper attempt to address?