Investigating the Gene Expression Profiles of Cells in Seven Embryonic Stages with Machine Learning Algorithms.

Lei Chen,XiaoYong Pan,Wei Guo,Zijun Gan,Yu-Hang Zhang,Zhibin Niu,Tao Huang,Yu-Dong Cai
DOI: https://doi.org/10.1016/j.ygeno.2020.02.004
IF: 4.31
2020-01-01
Genomics
Abstract:The development of embryonic cells involves several continuous stages, and some genes are related to embryogenesis. To date, few studies have systematically investigated changes in gene expression profiles during mammalian embryogenesis. In this study, a computational analysis using machine learning algorithms was performed on the gene expression profiles of mouse embryonic cells at seven stages. First, the profiles were analyzed through a powerful Monte Carlo feature selection method for the generation of a feature list. Second, increment feature selection was applied on the list by incorporating two classification algorithms: support vector machine (SVM) and repeated incremental pruning to produce error reduction (RIPPER). Through SVM, we extracted several latent gene biomarkers, indicating the stages of embryonic cells, and constructed an optimal SVM classifier that produced a nearly perfect classification of embryonic cells. Furthermore, some interesting rules were accessed by the RIPPER algorithm, suggesting different expression patterns for different stages.
What problem does this paper attempt to address?