Automatic Face Segmentation Using Color Cues for Coding Typical Videophone Scenes

YJ Zhang,YR Yao,Y He
DOI: https://doi.org/10.1117/12.263258
1997-01-01
Abstract:This paper presents a simple color segmentation technique which could be used in the model-based very low bit-rate coding approaches for videophone applications, in which the delimitation of the face of speaker is request. This work attempts to segment the face of speaker using color cues. To better take the advantage of the color contents of images, the color segmentation is carried out in HSI (Hue, Saturation, Intensity) space with the three components used in two steps. The original image is first splitted into two groups of regions, one has higher saturation values and other has lower saturation values, by using an adaptive threshold value applied to the histogram of saturation. In the high saturation regions, the hue component can furnish useful references for further segmentation, while in the low saturation regions the intensity component can play the similar role. For each group of regions, a multi-thresholding technique based on either hue or intensity component is then proposed for the subsequent segmentation. After both groups of regions are segmented, a combination of these two segmentation results will provide the finally segmented image. Some experiments with images taken from typical ''head-and-shoulders'' videophone sequences are carried out and some results are presented.
What problem does this paper attempt to address?