Detection Algorithm for Two-Person Conversations

Ke LI,Jia LIU
DOI: https://doi.org/10.3321/j.issn:1000-0054.2007.01.018
2007-01-01
Abstract:An algorithm is given to detect and track speakers in two-person telephone conversations.The approach uses a Gaussian mixture model with a universal background model(GMM-UBM) of speaker detection system as the core speaker recognition engine.The segmentation algorithm is based on the sum of the long-term distance and short-term distance measures,with an improved clustering process.Experiments on the NIST'99 evaluation database show that the detection system based on the segmentation algorithm provides good performance with an EER of 15.1%.
What problem does this paper attempt to address?