Artificial Intelligence-Based Video Saliency Prediction: Challenges and Trends

Jiongzhi Lin,Baitao Huang,Fei Zhou
DOI: https://doi.org/10.1109/mis.2024.3482950
IF: 6.744
2024-12-07
IEEE Intelligent Systems
Abstract:Video saliency prediction (VSP) aims to identify regions in videos that attract human attention and gaze. In the past, researchers have conducted extensive studies on VSP, establishing various video saliency datasets and prediction models. Leveraging the powerful end-to-end learning capabilities of deep learning techniques and the availability of large-scale video saliency datasets, the performance of saliency prediction models has significantly improved. Today, with the development of multimedia technologies, the task of VSP has generated numbers of promising directions, such as high dynamic range VSP and audio VSP, among others. This article focuses on the challenges of VSP in the context of multimedia technologies; reviews the research on video saliency, including video saliency datasets and prediction models; and then introduces potential research directions in conjunction with contemporary multimedia technologies.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?