A Multiple-Instance Scoring Method to Predict Tissue-Specific Cis-Regulatory Motifs and Regions

Jin Gu
DOI: https://doi.org/10.1038/npre.2009.4038.1
2009-01-01
Nature Precedings
Abstract:AbstractTranscription is the central process of gene regulation. In higher eukaryotes, the transcription of a gene is usually regulated by multiple cis-regulatory regions (CRRs). In different tissues, different transcription factors bind to their cis-regulatory motifs in these CRRs to drive tissue-specific expression patterns of their target genes. By combining the genome-wide gene expression data with the genomic sequence data, we proposed multiple-instance scoring (MIS) method to predict the tissue-specific motifs and the corresponding CRRs. The method is mainly based on the assumption that only a subset of CRRs of the expressed gene should function in the studied tissue. By testing on the simulated datasets and the fly muscle dataset, MIS can identify true motifs when noise is high and shows higher specificity for predicting the tissue-specific functions of CRRs.
What problem does this paper attempt to address?