Automatic Music Genre Classification Based on Auditory Images

Li Qiang,Li Qiuying,Guan Xin
2013-01-01
Abstract:Automatic music genre classification is an important part of the music information retrieval system.The concept of "auditory image" is introduced into music genre classification in this paper.Auditory image model(AIM)converts the one-dimensional audio signal into two-dimensional auditory images by simulating the human ear cochlear structures for the commonly database of GTZAN.And then,the methods of scale invariant feature transformation(SIFT)and space pyramid matching(SPM)are used to extract image features from the part to the whole.And the linear kernel support vector machine is chosen for classification since the dimension of features was high.Experimental results show that the genre classification accuracy based on the auditory images can be 15% higher than the Mel-frequency cepstral coefficients(MFCC)which is also based on the cochlear structure of the human ear.
What problem does this paper attempt to address?