Combining Fuzzy Clustering and Neural Networks to Predict Protein Structural Classes

G.M. Maggiora,C.T. Zhang,K.C. Chou,D.W. Elrod
DOI: https://doi.org/10.1016/b978-012213815-7/50012-1
1996-01-01
Abstract:This chapter describes a procedure for predicting protein structural classes, such as all-α, all-β, α + β, and α/β, from amino acid composition, based upon the combined use of fuzzy c-means (FCM) clustering and multi-layer perceptrons (MLP). Unlike the case in classical, crisp clusters, objects can belong, with varying degrees between zero and one, to more than one fuzzy cluster. The prediction procedure is implemented in two phases, training and testing. In the training phase, FCM clustering is first applied to a training set of proteins, where each protein is described by a 20-component vector in amino-acid composition space. After clustering, each protein is represented by a 4-component vector whose elements provide a measure of the degree that a given protein belongs to one of the four fuzzy clusters which represent the structural classes. The 4-component vectors are used as input to an MLP, and the network is trained. In the testing phase, a new set of proteins is presented to the combined FCM-clustering/MLP system which generates a prediction in a single step. The results obtained by this hybrid neuro-fuzzy procedure are comparable in quality to the best results obtained by other workers. Moreover, addition of the MLP improves the strength of the predictions over those obtained using FCM clustering alone.
What problem does this paper attempt to address?