Diversity and evolution of the human anellome

Sejal Modha,Joseph Hughes,Richard J. Orton,Spyros Lytras
DOI: https://doi.org/10.1101/2024.05.13.593858
2024-05-15
Abstract:Anelloviruses are a group of small, circular, single-stranded DNA viruses that are found ubiquitously across mammalian hosts. Here, we explored a large number of publicly available human microbiome datasets and retrieved a total of 829 anellovirus genomes, substantially expanding the known diversity of these viruses. The majority of new genomes fall within the three major human anellovirus genera: Alphatorquevirus, Betatorquevirus and Gammatorquevirus, while we also present one new genome of the under-sampled Hetorquevirus genus. The phylogeny reconstructed from the conserved ORF1 gene reveals three additional, previously undescribed, human anellovirus clades. We performed recombination analysis and show evidence of extensive recombination across all human anelloviruses. Interestingly, more than 95% of the detected events are between members of the same clade and only 15 inter-clade recombination events were detected. The breakpoints of recombination cluster in hotspots at the ends and outside of the ORF1 gene, while a recombination coldspot was detected within the gene. Our analysis suggests that anellovirus evolution is governed by homologous recombination, however events between distant viruses or ones producing chimaeric ORF1s likely lead to non-viable recombinants. The large number of genomes further allowed us to examine how essential genomic features vary across anelloviruses. These include functional domains in the ORF1 protein and the nucleotide motif of the replication loop region, required for the viruses' rolling-circle replication. A subset of the genomes assembled in both this and previous studies are completely lacking these essential elements, opening up the possibility that anellovirus intracellular populations contain defective virus genomes (DVGs). Overall, our study highlights key features of anellovirus genomics and evolution, a largely understudied group of viruses whose potential in virus-based therapeutics is recently being explored.
Evolutionary Biology
What problem does this paper attempt to address?