Model performance and interpretability of semi-supervised generative adversarial networks to predict oncogenic variants with unlabeled data

Zilin Ren,Quan Li,Kajia Cao,Marilyn M. Li,Yunyun Zhou,Kai Wang
DOI: https://doi.org/10.1186/s12859-023-05141-2
IF: 3.307
2023-02-11
BMC Bioinformatics
Abstract:It remains an important challenge to predict the functional consequences or clinical impacts of genetic variants in human diseases, such as cancer. An increasing number of genetic variants in cancer have been discovered and documented in public databases such as COSMIC, but the vast majority of them have no functional or clinical annotations. Some databases, such as CiVIC are available with manual annotation of functional mutations, but the size of the database is small due to the use of human annotation. Since the unlabeled data (millions of variants) typically outnumber labeled data (thousands of variants), computational tools that take advantage of unlabeled data may improve prediction accuracy.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?