Abstract:Gene expression involves transcription and translation. Despite large datasets and increasingly powerful methods devoted to calculating genetic variants’ effects on transcription, discrepancy between messenger RNA and protein levels hinders the systematic interpretation of the regulatory effects of disease-associated variants. Accurate models of the sequence determinants of translation are needed to close this gap and to interpret disease-associated variants that act on translation. Here we present Translatomer, a multimodal transformer framework that predicts cell-type-specific translation from messenger RNA expression and gene sequence. We train the Translatomer on 33 tissues and cell lines, and show that the inclusion of sequence improves the prediction of ribosome profiling signal, indicating that the Translatomer captures sequence-dependent translational regulatory information. The Translatomer achieves accuracies of 0.72 to 0.80 for the de novo prediction of cell-type-specific ribosome profiling. We develop an in silico mutagenesis tool to estimate mutational effects on translation and demonstrate that variants associated with translation regulation are evolutionarily constrained, both in the human population and across species. In particular, we identify cell-type-specific translational regulatory mechanisms independent of the expression quantitative trait loci for 3,041 non-coding and synonymous variants associated with complex diseases, including Alzheimer’s disease, schizophrenia and congenital heart disease. The Translatomer accurately models the genetic underpinnings of translation, bridging the gap between messenger RNA and protein levels as well as providing valuable mechanistic insights for uninterpreted disease variants. A transformer-based approach called Translatomer is presented, which models cell-type-specific translation from messenger RNA expression and gene sequence, bridging the gap between messenger RNA and protein levels as well as providing a mechanistic insight into the genetic regulation of translation.

Synthesis of inorganic polymers as glass precursors and for other uses: Pre‐ceramic block or graft copolymers as potential precursors to composite materials

VariPred: Enhancing Pathogenicity Prediction of Missense Variants Using Protein Language Models

Enhancing missense variant pathogenicity prediction with protein language models using VariPred

Genome-wide prediction of disease variant effects with a deep protein language model

Deep Learning Prediction of Ribosome Profiling with Translatomer Reveals Translational Regulation and Interprets Disease Variants

[The most important parasitic intestinal infections in calves and their role in diarrheal diseases].

Predicting the Disease Risk of Protein Mutation Sequences With Pre-training Model

Machine Learning of Three-Dimensional Protein Structures to Predict the Functional Impacts of Genome Variation

Accurate prediction of functional effect of single amino acid variants with deep learning

Protein language model rescue mutations highlight variant effects and structure in clinically relevant genes

TransEFVP: A Two-Stage Approach for the Prediction of Human Pathogenic Variants Based on Protein Sequence Embedding Fusion

Accurate proteome-wide missense variant effect prediction with AlphaMissense

Viral transmission: Deadly contact

Fine-tuning Protein Language Models with Deep Mutational Scanning improves Variant Effect Prediction

Mvppt: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants

REVEL: an Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants

Predicted mechanistic impacts of human protein missense variants

MVP predicts the pathogenicity of missense variants by deep learning

Multi-level Protein Representation Learning for Blind Mutational Effect Prediction

Utilizing protein structure graph embeddings to predict the pathogenicity of missense variants

Enhancing Missense Variant Pathogenicity Prediction with MissenseNet: Integrating Structural Insights and ShuffleNet-Based Deep Learning Techniques