Comparative analysis of microsatellites and compound microsatellites in T4-like viruses

lan zhou,liang deng,yongzhuo fu,xiaolong wu,xiangyan zhao,yubao chen,mingfu li,zhongyang tan
DOI: https://doi.org/10.1016/j.gene.2015.09.053
IF: 3.913
2016-01-01
Gene
Abstract:Microsatellites or simple sequence repeats (SSRs) are known to present ubiquitously in genomes of eukaryotes and prokaryotes, as well as viruses. A comprehensive analysis of microsatellites and compound microsatellites (CM) was performed for 67 T4-like bacteriophage genomes. We found that the number of repeats was generally proportional to the size of the genome. CM were more abundant in genic regions, while their relative abundance was higher in intergenic regions. Meanwhile, the number of CM rapidly decreased with the increase of complexity but gradually increased with higher dMAX (maximum distance between any two adjacent microsatellites). (A)n/(T)n, (AT)n/(TA)n and (AAG)n were the most abundant repeats of mono-, di- and trinucleotide microsatellites, respectively. The number of microsatellites in reference sequences was significantly lower than that in corresponding random sequences. This result was mainly attributed to mono- and dinucleotide repeats which hardly exceeded 6bp in T4-like viruses. These observations may be helpful to understand the distribution of microsatellites and viral genetic diversity in T4-like viruses.
What problem does this paper attempt to address?