Application of Markov Structure of Genomes to Outlier Identification and Read Classification

Alan F. Karr,Jason Hauzel,Adam A. Porter,Marcel Schaefer
DOI: https://doi.org/10.48550/arXiv.2112.13117
2021-12-25
Abstract:In this paper we apply the structure of genomes as second-order Markov processes specified by the distributions of successive triplets of bases to two bioinformatics problems: identification of outliers in genome databases and read classification in metagenomics, using real coronavirus and adenovirus data.
Genomics,Machine Learning
What problem does this paper attempt to address?