GMM Based Intelligent Detection of the Vocal Segment in Popular Music

李丽娟,叶茂,赵欣
2009-01-01
Abstract:Effective detection of vocal segment in popular music is very valuable in many applications.such as music retrieval.browsing.and cataloguing in a large database.melody extraction and singer recognition.The feature vector used to analyze the music signal in this paper is Mel-Frequency Cepstral Coeffcient which is generally used in speech signal processing.The methods of training GMM are applied to create the corresponding GMM for the non-vocal and vocal in a music respectively.which can achieve the goal of intelligent detection of vocal segment.In contrast to the conventional GMM training methods.using one group of training data which are hand-labeled as non-vocal and vocal to create one model for each class.On the other hand,in this paper.another GMM is created for each class using a group of pure vocal data and a group of pure non-vocal data respectively.And the probability models are obtained through the means of linear combination of the two GMMs of each class.The decision function used in this paper is likelihood probability classifier.The experiment results show that this method can improve the performance of the detection.
What problem does this paper attempt to address?