Feature-Based Causal Structure Discovery in Protein and Gene Expression Data with Bayesian Network

Jingwei Liu,Minghua Deng,Minping Qian
DOI: https://doi.org/10.1109/icnc.2009.667
2009-01-01
Abstract:Causal structure discovery is an important problem in protein sequences and gene--gene interaction in gene expression data, which will reveal the elementary structure of the protein sequence and the gene--gene interaction by the expression level of genes within the cell. In this paper, we investigate the feature--based causal structure learning methods for protein sequence andgene expression data respectively.Three feature extraction methods are proposed to model casual structurewith Bayesian network with Dirichlet distribution in protein sequence data, and a factor analysis based feature extraction method is discussed for gene expression data Bayesian network learning.The Truncated hemoglobinsuperfamily from SCOP protein database and Princeton colon gene expression data are involved to demonstrate the causal structure of Bayesian network determined by different feature extraction.
What problem does this paper attempt to address?