Quantitative analysis of automatic voice disorder detection studies for hybrid feature and classifier selection

Jong Bub Lee,Hyun Gyu Lee
DOI: https://doi.org/10.1016/j.bspc.2024.106014
IF: 5.1
2024-02-08
Biomedical Signal Processing and Control
Abstract:Owing to the development of machine learning, particularly deep learning, researchers have focused on automatic voice-disorder detection. However, voice-disorder datasets vary significantly in terms of the number of patients per disorder, and different conditions are targeted in different studies. Therefore, conducting direct comparisons of performances across related studies is complicated. Hence, we compare conventional machine learning, deep learning, and multimodal methods by establishing a fixed dataset and an evaluation pipeline using the Saarbrücken voice database, which is the most commonly used database for automatic voice-disorder detection. In addition, we propose an automatic voice-disorder detection method that combines features and classifiers. Experimental results show mean unweighted average recall differences of 8% and 15% on the abovementioned two datasets, respectively, and that the proposed combination improves them by 1.5% and 0.5%, respectively.
engineering, biomedical
What problem does this paper attempt to address?