Incorporating structural characteristics for identification of protein methylation sites

(c) 2009 Wiley Periodicals, Inc.

Bibliographische Detailangaben
Veröffentlicht in:Journal of computational chemistry. - 1984. - 30(2009), 9 vom: 15. Juli, Seite 1532-43
1. Verfasser: Shien, Dray-Ming (VerfasserIn)
Weitere Verfasser: Lee, Tzong-Yi, Chang, Wen-Chi, Hsu, Justin Bo-Kai, Horng, Jorng-Tzong, Hsu, Po-Chiang, Wang, Ting-Yuan, Huang, Hsien-Da
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2009
Zugriff auf das übergeordnete Werk:Journal of computational chemistry
Schlagworte:Journal Article Research Support, Non-U.S. Gov't Proteins Glutamic Acid 3KX376GY7L Asparagine 7006-34-0 Arginine 94ZLA3W45F Lysine K3Z4F929H6
Beschreibung
Zusammenfassung:(c) 2009 Wiley Periodicals, Inc.
Studies over the last few years have identified protein methylation on histones and other proteins that are involved in the regulation of gene transcription. Several works have developed approaches to identify computationally the potential methylation sites on lysine and arginine. Studies of protein tertiary structure have demonstrated that the sites of protein methylation are preferentially in regions that are easily accessible. However, previous studies have not taken into account the solvent-accessible surface area (ASA) that surrounds the methylation sites. This work presents a method named MASA that combines the support vector machine with the sequence and structural characteristics of proteins to identify methylation sites on lysine, arginine, glutamate, and asparagine. Since most experimental methylation sites are not associated with corresponding protein tertiary structures in the Protein Data Bank, the effective solvent-accessible prediction tools have been adopted to determine the potential ASA values of amino acids in proteins. Evaluation of predictive performance by cross-validation indicates that the ASA values around the methylation sites can improve the accuracy of prediction. Additionally, an independent test reveals that the prediction accuracies for methylated lysine and arginine are 80.8 and 85.0%, respectively. Finally, the proposed method is implemented as an effective system for identifying protein methylation sites. The developed web server is freely available at http://MASA.mbc.nctu.edu.tw/
Beschreibung:Date Completed 30.07.2009
Date Revised 21.11.2013
published: Print
Citation Status MEDLINE
ISSN:1096-987X
DOI:10.1002/jcc.21232