EVE‐X: Software to Identify Novel Viral Insertions in Wild‐Caught Arthropod Hosts From Next‐Generation Short Read Data

Jessen Havill,Olivia Strasburg,Tessy Udoh,Jacob E. Crawford,Andrea Gloria‐Soria
DOI: https://doi.org/10.1111/1755-0998.14026
IF: 7.7
2024-10-10
Molecular Ecology Resources
Abstract:Eukaryotic genomes harbour sequences derived from non‐retroviral RNA viruses, known as endogenous viral elements (EVEs) or non‐retroviral integrated RNA virus sequences (NIRVS). These sequences represent a record of past infections and have been implicated in host anti‐viral response. We have created a program to identify viral sequences integrated in a host genome. It begins with a specimen BAM file and outputs candidate NIRVS, along with putative host insertion sites and overlapping genomic features of the host genome in XML and visual formats, with minimal intermediary intervention. We ran through this software short‐read data derived from the genomes of 222 wild‐caught A. aegypti mosquitoes, from a dozen geographical regions, and located putative NIRVS from seven virus families. This program is as accurate as currently available software for NIRVS detection, and represents a significant improvement in adaptability and user‐friendliness. Furthermore, the flexibility of this pipeline allows the user to search for sequence integrations across the genome of any organism, as long as a query sequence database and a reference genome is provided. Potential extended applications include identification of integrated transgenic sequences used for research or vector control strategies.
biochemistry & molecular biology,ecology,evolutionary biology
What problem does this paper attempt to address?