EUPAN Enables Pan-Genome Studies of a Large Number of Eukaryotic Genomes

Zhiqiang Hu,Chen Sun,Kuang-chen Lu,Xixia Chu,Yue Zhao,Jinyuan Lu,Jianxin Shi,Chaochun Wei
DOI: https://doi.org/10.1093/bioinformatics/btx170
IF: 5.8
2017-01-01
Bioinformatics
Abstract:Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms.
What problem does this paper attempt to address?