Fisher Discriminative Embedding Low-Rank Sparse Representation for Music Genre Classification
Xin Cai,Hongjuan Zhang
DOI: https://doi.org/10.1007/s00034-024-02696-0
IF: 2.311
2024-05-16
Circuits Systems and Signal Processing
Abstract:This work focuses on a music genre classification method based on a sparse low-rank representation. Sparse low-rank representation is an effective method for learning classifiers, which aims to learn a row-sparse low-rank representation matrix to effectively ignore noise and identify subspace structures in data contaminated by outliers. However, these related methods fail to utilize the discriminative information to mine the rich supervision information available in the training samples. To address this issue, a novel Fisher Discriminative Embedding Low-Rank Sparse Representation (FDLRSR) classification algorithm is proposed based on the Fisher criterion, which results in stronger intra-class similarity and inter-class separability representation coefficients. Meanwhile, its two special cases, i.e., the Fisher Discriminative Embedding Low-Rank Representation (FDLR) and Fisher Discriminative Embedding Sparse Representation (FDSR) are also presented in this work. Specifically, the proposed classification method employs the FDLRSR algorithm coupled with the feature combinations consisting acoustic features and spectral features for music genre classification tasks by minimizing the residuals. Compared with the several state-of-the-art music genre classification methods, the proposed methods substantially improve the classification results on three widely used datasets, the GTZAN, ISMIR2004 and Homburg datasets, with the highest classification accuracies of 97.9% and 99.43%, which verify its effectiveness and availability.
engineering, electrical & electronic