A Machine Learning Approach to Predict Gene Regulatory Networks in Seed Development in Arabidopsis

Ying Ni,Delasa Aghamirzaie,Haitham Elmarakeby,Eva Collakova,Song Li,Ruth Grene,Lenwood S. Heath
DOI: https://doi.org/10.3389/fpls.2016.01936
IF: 5.6
2016-12-23
Frontiers in Plant Science
Abstract:Gene regulatory networks (GRNs) provide a representation of relationships between regulators and their target genes. Several methods for GRN inference, both unsupervised and supervised, have been developed to date. Because regulatory relationships consistently reprogram in diverse tissues or under different conditions, GRNs inferred without specific biological contexts are of limited applicability. In this report, a machine learning approach is presented to predict GRNs specific to developing <i>Arabidopsis thaliana</i> embryos. We developed the Beacon GRN inference tool to predict GRNs occurring during seed development in Arabidopsis based on a support vector machine (SVM) model. We developed both global and local inference models and compared their performance, demonstrating that local models are generally superior for our application. Using both the expression levels of the genes expressed in developing embryos and prior known regulatory relationships, GRNs were predicted for specific embryonic developmental stages. The targets that are strongly positively correlated with their regulators are mostly expressed at the beginning of seed development. Potential direct targets were identified based on a match between the promoter regions of these inferred targets and the <i>cis</i> elements recognized by specific regulators. Our analysis also provides evidence for previously unknown inhibitory effects of three positive regulators of gene expression. The Beacon GRN inference tool provides a valuable model system for context-specific GRN inference and is freely available at https://github.com/BeaconProjectAtVirginiaTech/beacon_network_inference.git.
plant sciences
What problem does this paper attempt to address?