Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing

Sergio Rubio-Martín,María Teresa García-Ordás,Martín Bayón-Gutiérrez,Natalia Prieto-Fernández,José Alberto Benítez-Andrades
DOI: https://doi.org/10.1007/s13755-024-00281-y
2024-03-06
Abstract:Purpose: Our study explored the use of artificial intelligence (AI) to diagnose autism spectrum disorder (ASD). It focused on machine learning (ML) and deep learning (DL) to detect ASD from text inputs on social media, addressing challenges in traditional ASD diagnosis.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The main purpose of this paper is to explore the application of Artificial Intelligence (AI) in diagnosing Autism Spectrum Disorder (ASD). Specifically, the research focuses on utilizing Machine Learning (ML) and Deep Learning (DL) models to detect potential ASD cases by analyzing text inputs, particularly from social media platforms like Twitter. This aims to overcome some of the current challenges in ASD diagnosis, such as the need for specialized medical personnel and substantial resources. Timely identification, especially in children, is crucial for providing immediate intervention and support, thereby improving the quality of life for affected individuals. ### Main Contributions 1. **Additional Machine Learning Models**: In addition to the models explored in the preliminary research, other models, particularly the K-Nearest Neighbors (KNN) algorithm, were trained and evaluated. The introduction and optimization of these models required significant time and computational resources. 2. **Extended Deep Learning Models**: Further exploration of deep learning models, including Recurrent Neural Networks (RNN) and Long Short-Term Memory networks (LSTM), which excel in handling sequential data such as text. 3. **Pre-trained BERT Models**: Two different pre-trained BERT models were tested, each with its unique training dataset and characteristics, affecting their performance in the task. ### Datasets and Methods - **Dataset Collection and Classification**: 404,627 tweets were extracted from Twitter users who explicitly stated in their profiles that they have ASD. The dataset was divided into two groups: ASD users and non-ASD users. The final subset used for training and testing contained 90,000 tweets (45,000 per group). - **Model Selection**: Various machine learning and deep learning models were used, including Decision Trees, Extreme Gradient Boosting (XGB), K-Nearest Neighbors (KNN), Recurrent Neural Networks (RNN), Long Short-Term Memory networks (LSTM), Bidirectional Long Short-Term Memory networks (Bi-LSTM), and pre-trained BERT and BERTweet models. ### Results - **Model Performance**: After training and testing, the BERTweet model performed the best with an accuracy of 87.7%. The accuracies of other models were: BERT (84.3%), XGB (71.6%), Bi-LSTM (70.3%), LSTM (69.5%), RNN (69.9%), Decision Tree (61.2%), and KNN (60.8%). - **Confusion Matrix**: The performance of the models was evaluated using a confusion matrix, with the BERTweet model showing excellent true positive and true negative rates, indicating high accuracy in identifying ASD patients. ### Conclusion The research results indicate that deep learning models, particularly BERT and BERTweet, have great potential in improving the accuracy of ASD detection and diagnosis. This innovative approach highlights the critical role of AI in advancing early diagnostic techniques, helping to improve patient outcomes, and underscores the importance of early identification of ASD, especially in children.