Further varieties of ancient endogenous retrovirus in human DNA

Martin Frith
DOI: https://doi.org/10.1101/2024.12.11.627920
2024-12-13
Abstract:A retrovirus inserts its genome into the DNA of a cell, occasionally a germ-line cell that gives rise to descendants of the host organism: it is then called an endogenous retrovirus (ERV). The human genome contains relics from many kinds of ancient ERV. Some relics contributed new genes and regulatory elements. This study finds further kinds of ancient ERV, in the thoroughly-studied human genome version hg38: ERV-Hako, ERV-Saru, ERV-Hou, ERV-Han, and ERV-Goku. It also finds many relics of ERV-V, previously known from just two copies on chromosome 19 with placental genes. It finds a type of ERV flanked by MER41E long terminal repeats (LTRs), with surprisingly little similarity to the known MER41 ERV. ERV-Hako has subtypes that contain sequence from host genes SUSD6 and SPHKAP: the SUSD6 variant was transferred between catarrhine and platyrrhine primates. A retrovirus uses tRNA to prime reverse transcription: Hako is the only human ERV relic that used tRNA-Trp (tryptophan, symbol W), and HERV-W is misnamed because it used tRNA-Arg, based on the Genomic tRNA Database. One ERV-Saru LTR is the previously-described enhancer of AIM2 in innate immunity. This study contributes to understanding primate ERV history, but also shows that related ERVs can have drastic differences, challenging the goal of clearly annotating all ERV relics in genomes.
Bioinformatics
What problem does this paper attempt to address?