Virus Classification Based on Q-vectors.

Hui Zheng,Jie Yang,Rong L. He,Stephen S-T Yau
DOI: https://doi.org/10.4310/cis.2019.v19.n1.a5
2019-01-01
Communications in Information and Systems
Abstract:Based on a Markov model, we propose a new alignment-free method, Q-vector (QV), for sequence analysis. It incorporates the length information of viral sequences and could reflect the relationship between low mers and high mers. Compared with the k-mer and composition vector methods, QV method is significantly more efficient and accurate in classifying viral genomes. By incorporating the distance matrices derived by the QV and natural vector, respectively, we define a new distance matrix for classifying viral genomes and reduce the classification errors even further. We also construct the phylogenetic trees based on the new distance.
What problem does this paper attempt to address?