CeTF: an R/Bioconductor package for transcription factor co-expression networks using regulatory impact factors (RIF) and partial correlation and information (PCIT) analysis

Carlos Alberto Oliveira de Biagi,Ricardo Perecin Nociti,Danielle Barbosa Brotto,Breno Osvaldo Funicheli,Patrícia de Cássia Ruy,João Paulo Bianchi Ximenez,David Livingstone Alves Figueiredo,Wilson Araújo Silva
DOI: https://doi.org/10.1186/s12864-021-07918-2
IF: 4.547
2021-08-20
BMC Genomics
Abstract:Abstract Background Finding meaningful gene-gene interaction and the main Transcription Factors (TFs) in co-expression networks is one of the most important challenges in gene expression data mining. Results Here, we developed the R package “CeTF” that integrates the Partial Correlation with Information Theory (PCIT) and Regulatory Impact Factors (RIF) algorithms applied to gene expression data from microarray, RNA-seq, or single-cell RNA-seq platforms. This approach allows identifying the transcription factors most likely to regulate a given network in different biological systems — for example, regulation of gene pathways in tumor stromal cells and tumor cells of the same tumor. This pipeline can be easily integrated into the high-throughput analysis. To demonstrate the CeTF package application, we analyzed gastric cancer RNA-seq data obtained from TCGA (The Cancer Genome Atlas) and found the HOXB3 gene as the second most relevant TFs with a high regulatory impact (TFs-HRi) regulating gene pathways in the cell cycle. Conclusion This preliminary finding shows the potential of CeTF to list master regulators of gene networks. CeTF was designed as a user-friendly tool that provides many highly automated functions without requiring the user to perform many complicated processes. It is available on Bioconductor ( http://bioconductor.org/packages/CeTF ) and GitHub ( http://github.com/cbiagii/CeTF ).
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?