TM-search: an Efficient and Effective Tool for Protein Structure Database Search.

Zi Liu,Chengxin Zhang,Qidi Zhang,Yang Zhang,Dong-Jun Yu
DOI: https://doi.org/10.1021/acs.jcim.3c01455
IF: 6.162
2024-01-01
Journal of Chemical Information and Modeling
Abstract:The quickly increasing size of the Protein Data Bank is challenging biologists to develop a more scalable protein structure alignment tool for fast structure database search. Although many protein structure search algorithms and programs have been designed and implemented for this purpose, most require a large amount of computational time. We propose a novel protein structure search approach, TM-search, which is based on the pairwise structure alignment program TM-align and a new iterative clustering algorithm. Benchmark tests demonstrate that TM-search is 27 times faster than a TM-align full database search while still being able to identify similar to 90% of all high TM-score hits, which is 2-10 times more than other existing programs such as Foldseek, Dali, and PSI-BLAST.
What problem does this paper attempt to address?