2D-RNA-coupling numbers : a new computational chemistry approach to link secondary structure topology with biological function
Methods for prediction of proteins, DNA, or RNA function and mapping it onto sequence often rely on bioinformatics alignment approach instead of chemical structure. Consequently, it is interesting to develop computational chemistry approaches based on molecular descriptors. In this sense, many resea...
Veröffentlicht in: | Journal of computational chemistry. - 1984. - 28(2007), 6 vom: 30. Apr., Seite 1049-56 |
---|---|
1. Verfasser: | |
Weitere Verfasser: | , , , , , , |
Format: | Aufsatz |
Sprache: | English |
Veröffentlicht: |
2007
|
Zugriff auf das übergeordnete Werk: | Journal of computational chemistry |
Schlagworte: | Journal Article Research Support, Non-U.S. Gov't Plant Proteins RNA 63231-63-0 Amino Acid Oxidoreductases EC 1.4.- 1-aminocyclopropane-1-carboxylic acid oxidase EC 1.4.3.- |
Zusammenfassung: | Methods for prediction of proteins, DNA, or RNA function and mapping it onto sequence often rely on bioinformatics alignment approach instead of chemical structure. Consequently, it is interesting to develop computational chemistry approaches based on molecular descriptors. In this sense, many researchers used sequence-coupling numbers and our group extended them to 2D proteins representations. However, no coupling numbers have been reported for 2D-RNA topology graphs, which are highly branched and contain useful information. Here, we use a computational chemistry scheme: (a) transforming sequences into RNA secondary structures, (b) defining and calculating new 2D-RNA-coupling numbers, (c) seek a structure-function model, and (d) map biological function onto the folded RNA. We studied as example 1-aminocyclopropane-1-carboxylic acid (ACC) oxidases known as ACO, which control fruit ripening having importance for biotechnology industry. First, we calculated tau(k)(2D-RNA) values to a set of 90-folded RNAs, including 28 transcripts of ACO and control sequences. Afterwards, we compared the classification performance of 10 different classifiers implemented in the software WEKA. In particular, the logistic equation ACO = 23.8 . tau(1)(2D-RNA) + 41.4 predicts ACOs with 98.9%, 98.0%, and 97.8% of accuracy in training, leave-one-out and 10-fold cross-validation, respectively. Afterwards, with this equation we predict ACO function to a sequence isolated in this work from Coffea arabica (GenBank accession DQ218452). The tau(1)(2D-RNA) also favorably compare with other descriptors. This equation allows us to map the codification of ACO activity on different mRNA topology features. The present computational-chemistry approach is general and could be extended to connect RNA secondary structure topology to other functions |
---|---|
Beschreibung: | Date Completed 12.06.2007 Date Revised 21.11.2008 published: Print Citation Status MEDLINE |
ISSN: | 1096-987X |