3D Pop-Ups: Omnidirectional image visual saliency prediction based on crowdsourced eye-tracking data in VR

Shiwei Cheng,Qi Lu,Zepeng Shen,Yang Liu,Yuejiang Hao,Ting Han
DOI: https://doi.org/10.1016/j.displa.2024.102746
IF: 3.074
2024-07-01
Displays
Abstract:The prediction of the visual saliency of omnidirectional images in VR is valuable for understanding visual behaviors. However, the equipment cost, software setups, hardware operation, and other constraints in acquiring eye-tracking data of omnidirectional images for visual saliency prediction would lead to a low training efficiency and prediction performance. Therefore, this paper proposed a crowdsourcing method based on recall fixations, which was used to collect and construct an omnidirectional image with eye-tracking dataset called CrowdSourcing360, which contained 16,200 pieces of data on 180 images. Using this dataset, a visual saliency prediction model CSnet360 was trained. Experiments demonstrated that the visual saliency prediction performance of the CSnet360 outperformed most existing models even without using actual gaze fixations. Finally, a VR interior design assistance prototype system was built and the preliminary study results indicated that the system could help designers to improve the quality of their design solutions.
engineering, electrical & electronic,instruments & instrumentation,optics,computer science, hardware & architecture
What problem does this paper attempt to address?