Human Fixation Detection Model in Video Compressed Domain Based on Markov Random Field

Yongjun Li,Yunsong Li,Weijia Liu,Jing Hu,Chiru Ge
DOI: https://doi.org/10.1117/1.jei.26.1.013008
IF: 0.829
2017-01-01
Journal of Electronic Imaging
Abstract:Recently, research on and applications of human fixation detection in video compressed domain have gained increasing attention. However, prediction accuracy and computational complexity still remain a challenge. This paper addresses the problem of compressed domain fixations detection in the videos based on residual discrete cosine transform coefficients norm (RDCN) and Markov random field (MRF). RDCN feature is directly extracted from the compressed video with partial decoding and is normalized. After spatial–temporal filtering, the normalized map [Smoothed RDCN (SRDCN) map] is taken to the MRF model, and the optimal binary label map is obtained. Based on the label map and the center saliency map, saliency enhancement and nonsaliency inhibition are done for the SRDCN map, and the final SRDCN-MRF salient map is obtained. Compared with the similar models, we enhance the available energy functions and introduce an energy function that indicates the positional information of the saliency. The procedure is advantageous for improving prediction accuracy and reducing computational complexity. The validation and comparison are made by several accuracy metrics on two ground truth datasets. Experimental results show that the proposed saliency detection model achieves superior performances over several state-of-the-art compressed-domain and pixel-domain algorithms on evaluation metrics. Computationally, our algorithm reduces 26% more computational complexity with comparison to similar algorithms.
What problem does this paper attempt to address?