Pro-Frame: similarity-based gene recognition in eukaryotic DNA sequences with errors

Andrey A. Mironov,Pavel S. Novichkov,Mikhail S. Gelfand,A. A. Mironov,P. S. Novichkov,M. S. Gelfand
DOI: https://doi.org/10.1093/bioinformatics/17.1.13
IF: 5.8
2001-01-01
Bioinformatics
Abstract:Performance of existing algorithms for similarity-based gene recognition in eukaryotes drops when the genomic DNA has been sequenced with errors. A modification of the spliced alignment algorithm allows for gene recognition in sequences with errors, in particular frameshifts. It tolerates up to 5% of sequencing errors without considerable drop of prediction reliability when a sufficiently close homologous protein is available (normalized evolutionary distance similarity score 50% or higher).
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?