A Meta-Analysis Strategy for Gene Prioritization Using Gene Expression, SNP Genotype, and eQTL Data

Jingmin Che,Miyoung Shin
DOI: https://doi.org/10.1155/2015/576349
2015-01-01
BioMed Research International
Abstract:In order to understand disease pathogenesis, improve medical diagnosis, or discover effective drug targets, it is important to identify significant genes deeply involved in human disease. For this purpose, many earlier approaches attempted to prioritize candidate genes using gene expression profiles or SNP genotype data, but they often suffer from producing many false-positive results. To address this issue, in this paper, we propose a meta-analysis strategy for gene prioritization that employs three different genetic resources—gene expression data, single nucleotide polymorphism (SNP) genotype data, and expression quantitative trait loci (eQTL) data—in an integrative manner. For integration, we utilized an improved technique for the order of preference by similarity to ideal solution (TOPSIS) to combine scores from distinct resources. This method was evaluated on two publicly available datasets regarding prostate cancer and lung cancer to identify disease-related genes. Consequently, our proposed strategy for gene prioritization showed its superiority to conventional methods in discovering significant disease-related genes with several types of genetic resources, while making good use of potential complementarities among available resources.
biotechnology & applied microbiology,medicine, research & experimental
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to more effectively identify genes significantly associated with human diseases in disease genomics research. Traditional methods mainly rely on gene expression profiles or single - nucleotide polymorphism (SNP) genotype data to prioritize candidate genes, but these methods often produce many false - positive results, increasing the time and cost of experimental verification. To solve this problem, this paper proposes a new meta - analysis strategy. By integrating three different genetic resources - gene expression data, SNP genotype data and expression quantitative trait loci (eQTL) data, the accuracy of discovering disease - related genes is improved. This method utilizes an improved technique - Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) - to synthesize the scores of different resources, so as to make better use of the potential complementarity among different resources.