Comparison of gene clustering criteria reveals intrinsic uncertainty in pangenome analyses

Saioa Manzano-Morales,Yang Liu,Sara González-Bodí,Jaime Huerta-Cepas,Jaime Iranzo
DOI: https://doi.org/10.1186/s13059-023-03089-3
IF: 17.906
2023-10-31
Genome Biology
Abstract:A key step for comparative genomics is to group open reading frames into functionally and evolutionarily meaningful gene clusters. Gene clustering is complicated by intraspecific duplications and horizontal gene transfers that are frequent in prokaryotes. In consequence, gene clustering methods must deal with a trade-off between identifying vertically transmitted representatives of multicopy gene families, which are recognizable by synteny conservation, and retrieving complete sets of species-level orthologs. We studied the implications of adopting homology, orthology, or synteny conservation as formal criteria for gene clustering by performing comparative analyses of 125 prokaryotic pangenomes.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?