CEG 2.0: an Updated Database of Clusters of Essential Genes Including Eukaryotic Organisms
Shuo Liu,Shu-Xuan Wang,Wei Liu,Chen Wang,Fa-Zhan Zhang,Yuan-Nong Ye,Candy-S Wu,Wen-Xin Zheng,Nini Rao,Feng-Biao Guo
DOI: https://doi.org/10.1093/database/baaa112
2020-01-01
Database
Abstract:Essential genes are key elements for organisms to maintain their living. Building databases that store essential genes in the form of homologous clusters, rather than storing them as a singleton, can provide more enlightening information such as the general essentiality of homologous genes in multiple organisms. In 2013, the first database to store prokaryotic essential genes in clusters, CEG (Clusters of Essential Genes), was constructed. Afterward, the amount of available data for essential genes increased by a factor >3 since the last revision. Herein, we updated CEG to version 2, including more prokaryotic essential genes (from 16 gene datasets to 29 gene datasets) and newly added eukaryotic essential genes (nine species), specifically the human essential genes of 12 cancer cell lines. For prokaryotes, information associated with drug targets, such as protein structure, ligand-protein interaction, virulence factor and matched drugs, is also provided. Finally, we provided the service of essential gene prediction for both prokaryotes and eukaryotes. We hope our updated database will benefit more researchers in drug targets and evolutionary genomics.