A Visual Attention Model Based On The Compressed Domain

Yiwei Jiang,De Xu
2006-01-01
Abstract:Visual attention allows primates to quickly select salient regions of an image. Over the past decades, a number of computable models of visual attention have been developed, but most of these models are based on the pixel domain. Little theoretical and computational work of visual attention is based on the compressed domain. In this paper, a visual attention model in the discrete cosine transform (DCT) domain is proposed. For each I frame, we utilize the characteristic of DCT coefficients to get IoNv-level features of the frame and combine those feature maps into a unique saliency map. For each P frame, we use the motion vectors and the saliency map of its reference frame to construct the saliency map. We present evidence for the validity of our method with several MPEG4 video clips. The experimental results are presented to demonstrate the efficiency and accuracy of our proposed algorithm.
What problem does this paper attempt to address?