RepeatModeler2 for automated genomic discovery of transposable element families

Jullien M Flynn,Robert Hubley,Clément Goubert,Jeb Rosen,Andrew G Clark,Cédric Feschotte,Arian F Smit,Jullien M. Flynn,Andrew G. Clark,Arian F. Smit
DOI: https://doi.org/10.1073/pnas.1921046117
IF: 11.1
2020-04-16
Proceedings of the National Academy of Sciences
Abstract:Significance Genome sequences are being produced for more and more eukaryotic species. The bulk of these genomes are composed of parasitic, self-mobilizing transposable elements (TEs) that play important roles in organismal evolution. Thus there is a pressing need for developing software that can accurately identify the diverse set of TEs dispersed in genome sequences. Here we introduce RepeatModeler2, an easy-to-use package for the production of reference TE libraries which can be applied to any eukaryotic species. Through several major improvements over the previous version, RepeatModeler2 is able to produce libraries that recapitulate the known composition of three model species with some of the most complex TE landscapes. Thus RepeatModeler2 will greatly enhance the discovery and annotation of TEs in genome sequences.
multidisciplinary sciences
What problem does this paper attempt to address?