Ensemble Model-Based Singer Classification with Proposed Vocal Segmentation

Balachandra Kumaraswamy
DOI: https://doi.org/10.1007/s11277-024-10928-4
IF: 2.017
2024-04-13
Wireless Personal Communications
Abstract:Music information retrieval (MIR) is a major topic in the domain of music retrieval and indexing. SID and categorization is a subset of MIR that may be applied to a variety of problems. People have gotten increasingly attached to music, searching for songs based on a genre or a specific performer. Songs with singer information in their tags may be readily filtered, while others cannot be identified devoid of listening. However, the majority of the songs were left out of the specifics. This paper mainly aims to propose the new singer classification model, where, pre-processing is initially done. Further, vocal segmentation is done using time domain filtering (TDF) and frequency domain filtering (FDF) and improved short-time Fourier transform (STFT). Further, timbre features, short-term energy (STE), mel-frequency cepstral coefficients (MFCCs) and improved vibrato estimation features are extracted that are given to EC that includes deep convolutional neural network (DCNN), bidirectional long short-term memory (LSTM) and deep belief network (DBN). Additionally, the final result is derived by averaging each ensemble classifier's efficiency.
telecommunications
What problem does this paper attempt to address?