Abstract:Background and objective: Screening children for communicational disorders such as specific language impairment (SLI) is always challenging as it requires clinicians to follow a series of steps to evaluate the subjects. Artificial intelligence and computer-aided diagnosis have supported health professionals in making swift and error-free decisions about the neurodevelopmental state of children vis-à-vis language comprehension and production. Past studies have claimed that typical developing (TD) and SLI children show distinct vocal characteristics that can serve as discriminating facets between them. The objective of this study is to group children in SLI or TD categories by processing their raw speech signals using two proposed approaches: a customized convolutional neural network (CNN) model and a hybrid deep-learning framework where CNN is combined with long-short-term-memory (LSTM). Method: We considered a publicly available speech database of SLI and typical children of Czech accents for this study. The convolution filters in both the proposed CNN and hybrid models (CNN-LSTM) estimated fuzzy-automated features from the speech utterance. We performed the experiments in five separate sessions. Data augmentations were performed in each of those sessions to enhance the training strength. Results: Our hybrid model exhibited a perfect 100% accuracy and F-measure for almost all the session-trials compared to CNN alone which achieved an average accuracy close to 90% and F-measure ≥ 92%. The models have further illustrated their robust classification essences by securing values of reliability indexes over 90%. Conclusion: The results confirm the effectiveness of proposed approaches for the detection of SLI in children using their raw speech signals. Both the models do not require any dedicated feature extraction unit for their operations. The models may also be suitable for screening SLI and other neurodevelopmental disorders in children of different linguistic accents.

Infant Sound Classification on Multi-stage CNNs with Hybrid Features and Prior Knowledge.

Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network

Classification of Infant Cry Based on Hybrid Audio Features and ResLSTM

Convolutional Neural Networks for Audio-Based Continuous Infant Cry Monitoring at Home

Using Transfer Learning, SVM, and Ensemble Classification to Classify Baby Cries Based on Their Spectrogram Images.

Deep Learning for Asphyxiated Infant Cry Classification Based on Acoustic Features and Weighted Prosodic Features

Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features

Attention Feature Fusion Network via Knowledge Propagation for Automated Respiratory Sound Classification

Infant Cry Classification with Graph Convolutional Networks

A CNN-Transformer-ConvLSTM-CRF Hybrid Network for Sleep Stage Classification

Infant Vocal Tract Development Analysis and Diagnosis by Cry Signals with CNN Age Classification

Baby Cry Recognition by BCRNet Using Transfer Learning and Deep Feature Fusion

Generalized Camera-Based Infant Sleep-Wake Monitoring in NICUs: A Multi-Center Clinical Trial

A Hybrid Deep Learning Scheme for Multi-Channel Sleep Stage Classification

Multi-task Learning for Audio-based Infant Cry Detection and Reasoning

Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU Data

A Hybrid Neonatal Sleep Staging Method Based on Convolutional Neural Networks and Graph Neural Networks

A Semi-supervised Multi-scale Arbitrary Dilated Convolution Neural Network for Pediatric Sleep Staging

InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries

InfantNet: A Deep Neural Network for Analyzing Infant Vocalizations

One-dimensional convolutional neural network and hybrid deep-learning paradigm for classification of specific language impaired children using their speech