Abstract:Joint analysis of multiple correlated phenotypes for genome-wide association studies (GWAS) can identify and interpret pleiotropic loci which are essential to understand pleiotropy in diseases and complex traits. Meanwhile, constructing a network based on associations between phenotypes and genotypes provides a new insight to analyze multiple phenotypes, which can explore whether phenotypes and genotypes might be related to each other at a higher level of cellular and organismal organization. In this paper, we first develop a bipartite signed network by linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN). The GPN can be constructed by a mixture of quantitative and qualitative phenotypes and is applicable to binary phenotypes with extremely unbalanced case-control ratios in large-scale biobank datasets. We then apply a powerful community detection method to partition phenotypes into disjoint network modules based on GPN. Finally, we jointly test the association between multiple phenotypes in a network module and a single nucleotide polymorphism (SNP). Simulations and analyses of 72 complex traits in the UK Biobank show that multiple phenotype association tests based on network modules detected by GPN are much more powerful than those without considering network modules. The newly proposed GPN provides a new insight to investigate the genetic architecture among different types of phenotypes. Multiple phenotypes association studies based on GPN are improved by incorporating the genetic information into the phenotype clustering. Notably, it might broaden the understanding of genetic architecture that exists between diagnoses, genes, and pleiotropy. Biological pleiotropy refers to a SNP or gene that has a direct biological influence on more than one phenotypic trait, which can offer significant insights in understanding the complex genotype-phenotype relationships. Network analyses provide an integrative approach to characterize complex genomic associations by linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN). Jointly analyzing multiple phenotypes and incorporating the genetic information into the phenotype clustering may increase the statistical power to discover the cross-phenotype association and pleiotropy. We evaluate our proposed multiple phenotype association tests based on network modules detected by GPN for 72 EHR-derived phenotypes in the diseases of the musculoskeletal system and connective tissue in the UK Biobank. From the post-GWAS analyses, we observe that the test based on GPN can identify more significantly enriched biological pathways than that without considering the network modules. Meanwhile, some of the uniquely identified SNPs by the test based on GPN are also colocalized in the eQTL study of the gene expression in the Muscle Skeletal tissue.

Network regression analysis for binary and ordinal categorical phenotypes in transcriptome-wide association studies.

Towards Identification of Human Disease Phenotype-Genotype Association via a Network-Module Based Method

Network Reconstruction for Trans Acting Genetic Loci Using Multi-Omics Data and Prior Information

Network Regression Analysis in Transcriptome-Wide Association Studies

Ordered multinomial regression for genetic association analysis of ordinal phenotypes at Biobank scale

CoNet: Efficient Network Regression for Survival Analysis in Transcriptome-Wide Association Studies-With Applications to Studies of Breast Cancer.

Identification of Phenotypic Networks Based on Whole Transcriptome by Comparative Network Decomposition

Novel Association Strategy with Copy Number Variation for Identifying New Risk Loci of Human Diseases

A novel method for multiple phenotype association studies based on genotype and phenotype network

Integrative Analysis of Genetical Genomics Data Incorporating Network Structures.

Interpreting Functional Impact of Genetic Variations by Network QTL for Genotype–Phenotype Association Study

Gene-based principal component logistic regression model and its application on genome-wide association study]

Detection of genes for ordinal traits in nuclear families and a unified approach for association studies.

Nonlinear network-based quantitative trait prediction from transcriptomic data

Development of network‐guided transcriptomic risk score for disease prediction

Network or regression-based methods for disease discrimination: a comparison study

A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data

NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA.

Logistic Regression Augmented Community Detection for Network Data with Application in Identifying Autism-Related Gene Pathways

A New System Identification Approach to Identify Genetic Variants in Sequencing Studies for a Binary Phenotype

Linkage Analysis of Ordinal Traits for Pedigree Data