Efficient Identification Of Speakers In News Video Based On Shot Segmentation

Qing Chang,Xiangyang Xue,Hong Lu,You-san Nie
DOI: https://doi.org/10.1109/ICARCV.2004.1469078
2004-01-01
Abstract:An effective method for speaker identification in news video is presented in this paper, which is based on shot segmentation and exploits both audio and visual cues. Firstly, audio is segmented by shot segmentation based on the observation that there is only one speaker in a shot of news video in most cases. Furthermore, speech/non-speech discrimination is implemented on each shot. Finally, text-independent speaker identification is proposed using audio features on the discriminated speech shots. Experimental results show that our algorithm can obtain satisfactory performance in identifying speakers, so it can be used in real application.
What problem does this paper attempt to address?