Using support vector machine to predict beta- and gamma-turns in proteins

(c) 2008 Wiley Periodicals, Inc.

Détails bibliographiques
Publié dans:Journal of computational chemistry. - 1984. - 29(2008), 12 vom: 30. Sept., Seite 1867-75
Auteur principal: Hu, Xiuzhen (Auteur)
Autres auteurs: Li, Qianzhong
Format: Article en ligne
Langue:English
Publié: 2008
Accès à la collection:Journal of computational chemistry
Sujets:Journal Article Research Support, Non-U.S. Gov't Proteins
Description
Résumé:(c) 2008 Wiley Periodicals, Inc.
By using the composite vector with increment of diversity, position conservation scoring function, and predictive secondary structures to express the information of sequence, a support vector machine (SVM) algorithm for predicting beta- and gamma-turns in the proteins is proposed. The 426 and 320 nonhomologous protein chains described by Guruprasad and Rajkumar (Guruprasad and Rajkumar J. Biosci 2000, 25,143) are used for training and testing the predictive model of the beta- and gamma-turns, respectively. The overall prediction accuracy and the Matthews correlation coefficient in 7-fold cross-validation are 79.8% and 0.47, respectively, for the beta-turns. The overall prediction accuracy in 5-fold cross-validation is 61.0% for the gamma-turns. These results are significantly higher than the other algorithms in the prediction of beta- and gamma-turns using the same datasets. In addition, the 547 and 823 nonhomologous protein chains described by Fuchs and Alix (Fuchs and Alix Proteins: Struct Funct Bioinform 2005, 59, 828) are used for training and testing the predictive model of the beta- and gamma-turns, and better results are obtained. This algorithm may be helpful to improve the performance of protein turns' prediction. To ensure the ability of the SVM method to correctly classify beta-turn and non-beta-turn (gamma-turn and non-gamma-turn), the receiver operating characteristic threshold independent measure curves are provided
Description:Date Completed 11.09.2008
Date Revised 28.07.2008
published: Print
Citation Status MEDLINE
ISSN:1096-987X
DOI:10.1002/jcc.20929