Abstract:Parkinson's disease (PD), a neurological disorder renowned for its impact on hearing and movement, has become a focal point in recent scientific research. Timely and accurate detection of Parkinson's disease is crucial for effective intervention and treatment. Speech impairment serves as a common early symptom, emphasizing the significance of vocal disorders in PD diagnosis systems. Deep learning models show promise in accurately diagnosing Parkinson's disease, utilizing vocal signal processing and acoustic recognition techniques. However, a persistent challenge lies in selecting relevant acoustic features, crucial for improving accuracy and reducing computational complexity. This paper presents a novel Hybrid Ensemble Feature Selection (HEFS) Framework designed for a PD detection system based on a Deep Neural Network (DNN). The framework incorporates a multi-stage EFS process, starting with the utilization of three Feature Selection techniques—ReliefF, mRMR, and Chi-square—to individually extract relevant features from the dataset. Subsequently, Multiple Layers Dimensionality Reduction (MLDR) is applied to matrices derived from this HEFS. The MLDR process encompasses the normalization of matrix scores, score combinations, reconstruction of the new dataset, and, ultimately, feature reduction using the Neighborhood Component Analysis (NCA) algorithm. The proposed model in the study exhibits notable advancements in PD classification based on vocal features. The model achieved an impressive accuracy rate of 97.08% and an F1-score of 98.10%. Comparative analysis with state-of-the-art diagnostic methods reveals the superiority of the proposed model, surpassing recent techniques like CNN, Bi-LSTM, and Multi-Kernel SVM in accuracy. The model's AUC value of 0.98 further supports its excellence in classification performance compared to the original dataset (AUC = 0.90). Overall, the research significantly contributes to PD diagnosis by presenting a powerful approach that combines the HEFS-MLDR technique with a DNN model. This study successfully integrates the HEFS-MLDR technique with a DNN model, creating a robust and efficient framework. The innovative hybrid architecture, incorporating EFS technique and a dimensionality reduction concept, produces high-performing classification tools for PD. The achieved high accuracy and F1-score highlight the potential of this methodology for early and accurate PD detection.

Tran-DSR: A hybrid model for dysarthric speech recognition using transformer encoder and ensemble learning

UTran-DSR: a novel transformer-based model using feature enhancement for dysarthric speech recognition

Enhancing dysarthric speech recognition through SepFormer and hierarchical attention network models with multistage transfer learning

A hybrid model for pathological voice recognition of post-stroke dysarthria by using 1DCNN and double-LSTM networks

Post-Stroke Dysarthria Voice Recognition based on Fusion Feature MSA and 1D

Speaker-Independent Dysarthria Severity Classification using Self-Supervised Transformers and Multi-Task Learning

Deep neural network architectures for dysarthric speech analysis and recognition

Voice disorder classification using speech enhancement and deep learning models

Detecting Dementia from Speech and Transcripts using Transformers

Automatic cross‐ and multi‐lingual recognition of dysphonia by ensemble classification using deep speaker embedding models

Residual Convolutional Neural Network-Based Dysarthric Speech Recognition

Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition

Transformer-based transfer learning on self-reported voice recordings for Parkinson's disease diagnosis

TranStutter: A Convolution-Free Transformer-Based Deep Learning Method to Classify Stuttered Speech Using 2D Mel-Spectrogram Visualization and Attention-Based Feature Representation

HEFS-MLDR: A novel hybrid ensemble feature selection framework for improved deep neural network architecture in the diagnosis of Parkinson's disease

Improving Dysarthric Speech Segmentation With Emulated and Synthetic Augmentation

Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction

A deep learning approach to dysarthric utterance classification with BiLSTM-GRU, speech cue filtering, and log mel spectrograms

Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment

A multi-stage transfer learning strategy for diagnosing a class of rare laryngeal movement disorders

Pre-trained models for detection and severity level classification of dysarthria from speech