Tree2GD: a phylogenomic method to detect large-scale gene duplication events

Duoyuan Chen,Taikui Zhang,Yamao Chen,Hong Ma,Ji Qi
DOI: https://doi.org/10.1093/bioinformatics/btac669
IF: 5.8
2022-10-11
Bioinformatics
Abstract:Abstract Motivation Whole-genome duplication events have long been discovered throughout the evolution of eukaryotes, contributing to genome complexity and biodiversity and leaving traces in the descending organisms. Therefore, an accurate and rapid phylogenomic method is needed to identify the retained duplicated genes on various lineages across the target taxonomy. Results Here, we present Tree2GD, an integrated method to identify large-scale gene duplication events by automatically perform multiple procedures, including sequence alignment, recognition of homolog, gene tree/species tree reconciliation, Ks distribution of gene duplicates and synteny analyses. Application of Tree2GD on 2 datasets, 12 metazoan genomes and 68 angiosperms, successfully identifies all reported whole-genome duplication events exhibited by these species, showing effectiveness and efficiency of Tree2GD on phylogenomic analyses of large-scale gene duplications. Availability and implementation Tree2GD is written in Python and C++ and is available at https://github.com/Dee-chen/Tree2gd Supplementary information Supplementary data are available at Bioinformatics online.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?