Gaussian Kernel Based Gene Selection in a Single Cell Gene Decision Space

Zhaowen Li,Junhong Feng,Jie Zhang,Fang Liu,Pei Wang,Ching-Feng Wen
DOI: https://doi.org/10.1016/j.ins.2022.08.050
IF: 8.1
2022-01-01
Information Sciences
Abstract:Information system is a database that shows relationships between objects and attributes. A real-valued information system is an information system whose information values are real numbers. A real-valued information system with decision attributes is referred to as a real-valued decision information system. If objects, conditional attributes and information values in a real-valued decision information system are cells, genes and gene expression values, respectively, then this information system is said to be a gene decision space. In a gene decision space, people are faced with gene expression data. If gene expression data in a gene decision space changes to single cell RNA-seq data, then this space is referred to as a single cell gene decision space. This paper explores gene selection in a single cell gene decision space in terms of Gaussian kernel. In the first place, the distance between two cells in each subspace of a single cell gene decision space is constructed. Next, the fuzzy Tcos-equivalence relation on the cell set is obtained in terms of Gaussian kernel. After that, measures of uncertainty for a single cell gene decision space are investigated. Lastly, gene selection algorithms in a single cell gene decision space are presented in terms of the proposed information entropy and information granularity. The presented algorithms are testified in several publicly open single cell RNA-seq data sets. Experimental results reveal that the presented algorithms can select appropriate genes related to classification, and significantly improve classification performances.
What problem does this paper attempt to address?