Identifying mutation‐driven changes in gene functionality that lead to venous thromboembolism

Yanran Wang,Yana Bromberg
DOI: https://doi.org/10.1002/humu.23824
2019-09-01
Human Mutation
Abstract:Venous thromboembolism (VTE) is a common hematological disorder. VTE affects millions of people around the world each year and can be fatal. Earlier studies have revealed the possible VTE genetic risk factors in Europeans. The 2018 Critical Assessment of Genome Interpretation (CAGI) challenge had asked participants to distinguish between 66 VTE and 37 non‐VTE African American (AA) individuals based on their exome sequencing data. We used variants from AA VTE association studies and VTE genes from DisGeNET database to evaluate VTE risk via four different approaches; two of these methods were most successful at the task. Our best performing method represented each exome as a vector of predicted functional effect scores of variants within the known genes. These exome vectors were then clustered with k‐means. This approach achieved 70.8% precision and 69.7% recall in identifying VTE patients. Our second‐best ranked method had collapsed the variant effect scores into gene‐level function changes, using the same vector clustering approach for patient/control identification. These results show predictability of VTE risk in AA population and highlight the importance of variant‐driven gene functional changes in judging disease status. Of course, more in‐depth understanding of AA VTE pathogenicity is still needed for more precise predictions.This article is protected by copyright. All rights reserved.
genetics & heredity
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to identify the gene - function changes that lead to venous thromboembolism (VTE), especially for the African - American (AA) population. VTE is a common blood disease that affects millions of people around the world every year and can lead to fatal consequences. Although previous studies have revealed possible genetic risk factors for VTE in the European population, there are fewer studies on genetic risk factors for VTE in the African - American population. Therefore, this article aims to analyze the exome - sequencing data of AA individuals to distinguish VTE patients from non - VTE patients, in the hope of discovering gene variants related to VTE and their impact on gene function. Specifically, the researchers used variant data from AA VTE - association studies and VTE - related genes in the DisGeNET database to evaluate VTE risk through four different methods. Among these methods, two methods based on variant - function annotation performed best and were able to effectively distinguish VTE patients from non - VTE patients. The best method was to represent each exon as a vector of functional - effect scores of variants in known genes, and then use the k - means clustering method for classification, achieving a precision rate of 70.8% and a recall rate of 69.7%. The second - best method was to summarize the variant - effect scores into gene - level functional changes, and also use the vector - clustering method for patient/control identification. Overall, this study not only shows the possibility of predicting VTE risk in the AA population, but also emphasizes the importance of variant - driven gene - function changes in determining disease status. However, the researchers also point out that a deeper understanding of the pathological mechanisms of AA VTE is still required in order to achieve more accurate prediction.