Abstract:Early diagnosis of medical conditions in infants is crucial for ensuring timely and effective treatment. However, infants are unable to verbalize their symptoms, making it difficult for healthcare professionals to accurately diagnose their conditions. Crying is often the only way for infants to communicate their needs and discomfort. In this paper, we propose a medical diagnostic system for interpreting infants' cry audio signals (CAS) using a combination of different audio domain features and deep learning (DL) algorithms. The proposed system utilizes a dataset of labeled audio signals from infants with specific pathologies. The dataset includes two infant pathologies with high mortality rates, neonatal respiratory distress syndrome (RDS), sepsis, and crying. The system employed the harmonic ratio (HR) as a prosodic feature, the Gammatone frequency cepstral coefficients (GFCCs) as a cepstral feature, and image-based features through the spectrogram which are extracted using a convolution neural network (CNN) pretrained model and fused with the other features to benefit multiple domains in improving the classification rate and the accuracy of the model. The different combination of the fused features is then fed into multiple machine learning algorithms including random forest (RF), support vector machine (SVM), and deep neural network (DNN) models. The evaluation of the system using the accuracy, precision, recall, F1-score, confusion matrix, and receiver operating characteristic (ROC) curve, showed promising results for the early diagnosis of medical conditions in infants based on the crying signals only, where the system achieved the highest accuracy of 97.50% using the combination of the spectrogram, HR, and GFCC through the deep learning process. The finding demonstrated the importance of fusing different audio features, especially the spectrogram, through the learning process rather than a simple concatenation and the use of deep learning algorithms in extracting sparsely represented features that can be used later on in the classification problem, which improves the separation between different infants' pathologies. The results outperformed the published benchmark paper by improving the classification problem to be multiclassification (RDS, sepsis, and healthy), investigating a new type of feature, which is the spectrogram, using a new feature fusion technique, which is fusion, through the learning process using the deep learning model.

Real-Time Multi-Level Neonatal Heart and Lung Sound Quality Assessment for Telehealth Applications

Real-time Neonatal Chest Sound Separation using Deep Learning

A New Non-Negative Matrix Co-Factorisation Approach for Noisy Neonatal Chest Sound Separation

Prediction of Neonatal Respiratory Distress in Term Babies at Birth from Digital Stethoscope Recorded Chest Sounds

Deep learning based non-contact physiological monitoring in Neonatal Intensive Care Unit

Point-of-Care Real-Time Signal Quality for Fetal Doppler Ultrasound Using a Deep Learning Approach

A Deep Learning Approach for the Assessment of Signal Quality of Non-Invasive Foetal Electrocardiography

Unsupervised Learning-Based Non-Invasive Fetal ECG Muti-Level Signal Quality Assessment

Improving Robustness and Clinical Applicability of Automatic Respiratory Sound Classification Using Deep Learning-Based Audio Enhancement: Algorithm Development and Validation Study

A cry for help: Early detection of brain injury in newborns

Energy-Efficient Respiratory Anomaly Detection in Premature Newborn Infants

Multi-task cascaded assessment of signal quality for long-term single-lead ECG monitoring

Bedside Monitoring Tools and Advanced Signal Processing Approaches to Monitor Critically-ill Infants

System Level Framework for Assessing the Accuracy of Neonatal EEG Acquisition

Camera-based Cardiorespiratory Monitoring of Preterm Infants in NICU

Deep learning in the ultrasound evaluation of neonatal respiratory status

Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation

Infant Cry Signal Diagnostic System Using Deep Learning and Fused Features

Contactless radar-based breathing monitoring of premature infants in the neonatal intensive care unit

Practical implementation of artificial intelligence algorithms in pulmonary auscultation examination

Deep Neural Network for Respiratory Sound Classification in Wearable Devices Enabled by Patient Specific Model Tuning