Abstract:Children with autism spectrum disorders (ASDs) tremendously impact people's lives, and the incidence and prevalence of ASDs are increasing globally. Global health organisations and other autism-treatment centres specialising in autism diagnosis and detection are encountering challenges on how to provide an appropriate ASD diagnosis system that enables accurate analyses and early detection of autism. Information about ASD detection is affected by unknown aetiology of the disease, and an urgent solution is required to investigate its aetiological factors. Accordingly, increasing the opportunities to provide evidence of the 'sociodemographic and family characteristics' risk factors in predicting ASD is a scientific complex problem that needs to be solved. This study developed an early prediction model for diagnosing and detecting children with ASD based on effective sociodemographic and family characteristic features related to ASD using the machine learning (ML) model. The proposed methodology involves three phases. The identification phase is first accomplished by identifying a large-scale ASD dataset and preprocessing stages: 1-NN model for imputing missing data, feature-selection methods using Chi2 and Relief, and adaptive balancing data approach using Synthetic Minority Oversampling Technique. Chi2 and Relief are applied to determine the most effective sociodemographic and family characteristic features and produce a new balanced ASD dataset. The second development phase trains and tests the newly prepared ASD dataset through eight ML methods: decision tree, random forest, Naive Bayes, kNN, SVM, logistic regression, AdaBoost, and neural network multilayer perceptron (MLP). The developed model is evaluated in the third phase using five metrics: accuracy, precision, recall, F1 and AUROC, and test time in seconds. Results indicated the following: (1) Out of 10 highly effective sociodemographic and family characteristic features, seven related to autism cases are extracted. (2) Correlation sensitivity analysis reveals that the ' Mom_age_at_child_birth ' has the highest positive correlation with ' Father_age_at_child_birth ,' with an r -value of 0.751. Moreover, 'child_birth_month' and ' Birth_number ' have the highest negative correlation with ' Ses_points_1_10 ', with an r -value of (− 0.07). (3) AdaBoost, neural network, K-nearest neighbour, and decision tree methods show higher accuracy results (0.9995, 0.9925, 0.9834, and 0.9786, respectively), whereas random forest, logistic regression, and Naive Bayes methods show relatively lower accuracy (0.8297, 0.8199 and 0.8002, respectively). However, the support vector machine method shows the lowest accuracy (0.7105). AdaBoost obtained the highest accuracy on the basis of four other evaluation metrics (AUC = 0.9999, F 1 = 0.9995, precision = 0.9995 and recall = 0.9995). Accordingly, the new preprocessed and balanced ASD dataset can be utilised as a data source for autism research. The preprocessing stages can be considered correct and successfully perform better results than the original ASD dataset. Similar results from Chi2 and Relief in the feature-selection approaches substantially improved the classification accuracy. The study confirms the efficacy of the proposed prediction model compared with previous models in different comparative points. Early prediction of autism is possible through this proposed model.

An intelligent approach for autism spectrum disorder diagnosis and rehabilitation features identification

A personalized classification of behavioral severity of autism spectrum disorder using a comprehensive machine learning framework

Efficient Machine Learning Models for Early Stage Detection of Autism Spectrum Disorder

Developing an Artificial Intelligence Based Model for Autism Spectrum Disorder Detection in Children

Identification of Autism spectrum disorder based on a novel feature selection method and Variational Autoencoder

An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder

Identification of Autism Based on SVM-RFE and Stacked Sparse Auto-Encoder

Classification of Autism Spectrum Disorder Using Random Support Vector Machine Cluster

Resolving autism spectrum disorder (ASD) through brain topologies using fMRI dataset with multi-layer perceptron (MLP)

Detecting Autism Spectrum Disorder using Machine Learning

Detection of Autism Spectrum Disorder using fMRI Functional Connectivity with Feature Selection and Deep Learning

Understanding the Role of Connectivity Dynamics of Resting-State Functional MRI in the Diagnosis of Autism Spectrum Disorder: A Comprehensive Study

Recognition of autism in subcortical brain volumetric images using autoencoding-based region selection method and Siamese Convolutional Neural Network

A Three-Stage Teacher, Student Neural Networks and Sequential Feed Forward Selection-Based Feature Selection Approach for the Classification of Autism Spectrum Disorder.

ASD-DiagNet: A Hybrid Learning Approach for Detection of Autism Spectrum Disorder Using fMRI Data

Early automated prediction model for the diagnosis and detection of children with autism spectrum disorders based on effective sociodemographic and family characteristic features

Diagnosis of Autism Spectrum Disorders in Young Children Based on Resting-State Functional Magnetic Resonance Imaging Data Using Convolutional Neural Networks

Automated Detection of Autism Spectrum Disorder Using Bio-Inspired Swarm Intelligence Based Feature Selection and Classification Techniques

Early Screening of Autism Spectrum Disorder Diagnoses of Children Using Artificial Intelligence

Feature Signature Discovery for Autism Detection: An Automated Machine Learning Based Feature Ranking Framework

Topological Properties of Resting-State fMRI Functional Networks Improve Machine Learning-Based Autism Classification