A Multiple Criteria Framework for 3D Protein Structure Similarity Retrieval

Min HU,Qun-Sheng PENG,Li-Guang XIE,Tao ZHANG,Wei CHEN
DOI: https://doi.org/10.3321/j.issn:0254-4164.2006.12.019
2006-01-01
Jisuanji Xuebao/Chinese Journal of Computers
Abstract:The intrinsic relationship between the function of a protein and its structure is an important issue in the study of contemporary life science. Although the similarity comparisons of protein structures can provide some hints in such study, efficient retrieval of proteins based on 3D structure similarity is still a hard task due to the continually increasing large protein datasets. To overcome this difficulty, a multiple criteria framework (MCF) is proposed to reduce the computation cost. Three kinds of features, which are invariant against translation and rotation, are adopted as the criteria successively during the retrieval process under MCF, including the spatial walking of protein’s backbone, distance histogram and the radial distribution of the distance matrix. While the protein retrieval based on each of the above features involves only simple calculation, the intersection of their retrieval results reduce the candidate set dramatically and rapidly. Experiments using query-by-example on a representative database, including 27804 samples, demonstrate that the techniques can cut down the pruning time cost of traditional methods effectively while retaining the sensitivity. The approach is highly complementary to rapid protein structure similarity retrieval.
What problem does this paper attempt to address?