Abstract:BackgroundIn the United States, about 3 million people have autism spectrum disorder (ASD), and around 1 out of 59 children are diagnosed with ASD. People with ASD have characteristic social communication deficits and repetitive behaviors. The causes of this disorder remain unknown; however, in up to 25% of cases, a genetic cause can be identified. Detecting ASD as early as possible is desirable because early detection of ASD enables timely interventions in children with ASD. Identification of ASD based on objective pathogenic mutation screening is the major first step toward early intervention and effective treatment of affected children. ObjectiveRecent investigation interrogated genomics data for detecting and treating autism disorders, in addition to the conventional clinical interview as a diagnostic test. Since deep neural networks perform better than shallow machine learning models on complex and high-dimensional data, in this study, we sought to apply deep learning to genetic data obtained across thousands of simplex families at risk for ASD to identify contributory mutations and to create an advanced diagnostic classifier for autism screening. MethodsAfter preprocessing the genomics data from the Simons Simplex Collection, we extracted top ranking common variants that may be protective or pathogenic for autism based on a chi-square test. A convolutional neural network–based diagnostic classifier was then designed using the identified significant common variants to predict autism. The performance was then compared with shallow machine learning–based classifiers and randomly selected common variants. ResultsThe selected contributory common variants were significantly enriched in chromosome X while chromosome Y was also discriminatory in determining the identification of autistic individuals from nonautistic individuals. The ARSD, MAGEB16, and MXRA5 genes had the largest effect in the contributory variants. Thus, screening algorithms were adapted to include these common variants. The deep learning model yielded an area under the receiver operating characteristic curve of 0.955 and an accuracy of 88% for identifying autistic individuals from nonautistic individuals. Our classifier demonstrated a considerable improvement of ~13% in terms of classification accuracy compared to standard autism screening tools. ConclusionsCommon variants are informative for autism identification. Our findings also suggest that the deep learning process is a reliable method for distinguishing the diseased group from the control group based on the common variants of autism.

Prioritizing Autism Risk Genes Using Personalized Graphical Models Estimated From Single-Cell RNA-seq Data

Diagnostic Classification and Prognostic Prediction Using Common Genetic Variants in Autism Spectrum Disorder: Genotype-Based Deep Learning

Network assisted analysis to reveal the genetic basis of autism

Prediction and prioritization of autism-associated long non-coding RNAs using gene expression and sequence features

A scalable, high-throughput neural development platform identifies shared impact of ASD genes on cell fate and differentiation

Proximity analysis of native proteomes reveals phenotypic modifiers in a mouse model of autism and related neurodevelopmental conditions

Inherited and multiple de novo mutations in autism/developmental delay risk genes suggest a multifactorial model

Convergent Coexpression of Autism-Associated Genes Suggests Some Novel Risk Genes May Not Be Detectable in Large-Scale Genetic Studies.

Graph Node Classification to Predict Autism Risk in Genes

Cell Type-Specific Predictive Models Perform Prioritization of Genes and Gene Sets Associated With Autism

Exploiting Aberrant Mrna Expression in Autism for Gene Discovery and Diagnosis

Unraveling the immunogenetic landscape of autism spectrum disorder: a comprehensive bioinformatics approach

Dynamic convergence of autism disorder risk genes across neurodevelopment

Prioritizing Genes Associated with Brain Disorders by Leveraging Enhancer-Promoter Interactions in Diverse Neural Cells and Tissues

Integrated Analysis and Identification of Genetic Risk Alleles Related to Autism Spectrum Disorder

Autism genes converge on microtubule biology and RNA-binding proteins during excitatory neurogenesis

Logistic Regression Augmented Community Detection for Network Data with Application in Identifying Autism-Related Gene Pathways

Targeted Resequencing of 358 Candidate Genes for Autism Spectrum Disorder in a Chinese Cohort Reveals Diagnostic Potential and Genotype-Phenotype Correlations

Targeted sequencing and integrative analysis of 3,195 Chinese patients with neurodevelopmental disorders prioritized 26 novel candidate genes

Genetic Insights of Schizophrenia via Single Cell RNA-Sequencing Analyses