Protein Sequence Comparison Based on Physicochemical Properties and the Position-Feature Energy Matrix

Lulu Yu,Yusen Zhang,Ivan Gutman,Yongtang Shi,Matthias Dehmer
DOI: https://doi.org/10.1038/srep46787
IF: 4.6
2017-01-01
Scientific Reports
Abstract:We develop a novel position-feature-based model for protein sequences by employing physicochemical properties of 20 amino acids and the measure of graph energy. The method puts the emphasis on sequence order information and describes local dynamic distributions of sequences, from which one can get a characteristic B-vector. Afterwards, we apply the relative entropy to the sequences representing B-vectors to measure their similarity/dissimilarity. The numerical results obtained in this study show that the proposed methods leads to meaningful results compared with competitors such as Clustal W.
What problem does this paper attempt to address?