A New Strategy to Evaluate Protein Motifs

DU Chunjuan,ZHU Yunping,HE Fuchu,Zeng Yanjun
DOI: https://doi.org/10.3969/j.issn.1002-3208.2005.02.005
2005-01-01
Abstract:Motif is an important concept for describing the common structure and function shared by the members of a protein family. Anyhow, it is still a difficult task to properly identify and evaluate the motifs derived from various bioinformatical means. This paper introduces a new strategy for evaluating motifs, which is based upon the notation of classifier. It compares those motifs constructed by various methods based upon a single protein family, and implies the best motif that of the most biologically significant. Seven cytokine families in the PROSITE protein database are processed by both the MEME and HMMER methods to generate their respective motifs, then each individual motif is regarded as a classifier and used to compute both its sensitivity and specificity indices for the same cytokine family, and the resulting receiver operating characteristic curves derived from their corresponding motifs are compared; these indices and comparisons are used to sort out the best motif model for each cytokine family, meanwhile either true or false motif is discriminated quantitatively. Such a strategy may be expanded further for evaluating any motif for any protein family, and the best motif may be used to predict novel member(s) for a given protein family by means of database searching.
What problem does this paper attempt to address?