ESCCdb: A Comprehensive Database and Key Regulator Exploring Platform Based on Cross Dataset Comparisons for Esophageal Squamous Cell Carcinoma

Jian Yang,Liyun Bi,Chen Wang,Gang Wang,Yixiong Gou,Liting Dong,Maoxu Wang,Hong Luo,Kun Wang,Yu Wang,Yue Huang,Haoyang Cai,Zhixiong Xiao
DOI: https://doi.org/10.1016/j.csbj.2023.03.026
IF: 6.155
2023-01-01
Computational and Structural Biotechnology Journal
Abstract:Esophageal cancer is the seventh most prevalent and the sixth most lethal cancer. Esophageal squamous cell carcinoma (ESCC) is one of the major esophageal cancer subtypes that accounts for 87 % of the total cases. However, its molecular mechanism remains unclear. Here, we present an integrated database for ESCC called ESCCdb, which includes a total of 56 datasets and published studies from the GEO, Xena or SRA databases and related publications. It helps users to explore a particular gene with multiple graphical and interactive views with one click. The results comprise expression changes across 20 datasets, copy number alterations in 11 datasets, somatic mutations from 12 papers, related drugs derived from DGIdb, related pathways, and gene correlations. ESCCdb enables directly cross-dataset comparison of a gene’s mutations, expressions and copy number changes in multiple datasets. This allows users to easily assess the alterations in ESCC. Furthermore, survival analysis, drug-gene relationships, and results from whole-genome CRISPR/Cas9 screening can help users determine the clinical relevance, derive functional inferences, and identify potential drugs. Notably, ESCCdb also enables the exploration of the correlation structure and identification of potential key regulators for a process. Finally, we identified 789 consistently differential expressed genes; we summarized recurrently mutated genes and genes affected by significant copy number alterations. These genes may be stable biomarkers or important players during ESCC development. ESCCdb fills the gap between massive omics data and users’ needs for integrated analysis and can promote basic and clinical ESCC research. The database is freely accessible at http://cailab.labshare.cn/ESCCdb.
What problem does this paper attempt to address?