| Literature DB >> 26343792 |
Wei Chen1,2, Hong Tran2, Zhiyong Liang3, Hao Lin3, Liqing Zhang2.
Abstract
Knowledge of the distribution of N(6)-methyladenosine (m(6)A) is invaluable for understanding RNA biological functions. However, limitation in experimental methods impedes the progress towards the identification of m(6)A site. As a complement of experimental methods, a support vector machine based-method is proposed to identify m(6)A sites in Saccharomyces cerevisiae genome. In this model, RNA sequences are encoded by their nucleotide chemical property and accumulated nucleotide frequency information. It is observed in the jackknife test that the accuracy achieved by the proposed model in identifying the m(6)A site was 78.15%. For the convenience of experimental scientists, a web-server for the proposed model is provided at http://lin.uestc.edu.cn/server/m6Apred.php.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26343792 PMCID: PMC4561376 DOI: 10.1038/srep13859
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sequence logo of the 10 upstream and 10 downstream nucleotides surrounding m6A sites.
The predictive results by using different features for m6A identification.
| Features | |||
|---|---|---|---|
| Ring Structure | 69.27 | 63.43 | 66.34 |
| Functional Group | 70.70 | 69.90 | 70.31 |
| Hydrogen Bond | 74.18 | 68.46 | 71.32 |
| Nucleotide chemical property | 75.23 | 78.02 | 75.87 |
| Nucleotide chemical property and accumulated nucleotide frequency | 79.21 | 77.04 | 78.13 |
Performance of the proposed model at different thresholds on jackknife test.
| Classifier | |||
|---|---|---|---|
| High | 38.22 | 94.95 | 66.59 |
| Medium | 55.05 | 90.02 | 72.54 |
| Low | 68.39 | 84.98 | 76.68 |
Figure 2A graphical illustration to show the performance of the model by means of the ROC curve.
The vertical coordinate is the true positive rate (Sn) while horizontal coordinate is the false positive rate (1-Sp). The area under the ROC curve (AUROC) is 0.84.
Comparison of different classifiers for m6A identification.
| Classifier | ||||
|---|---|---|---|---|
| Blast | 70.75 | 67.55 | 69.11 | – |
| Naïve Bayes | 78.72 | 70.91 | 74.81 | 0.82 |
| Logistic Function | 79.32 | 74.76 | 77.04 | 0.83 |
| RBFNetwork | 61.18 | 84.49 | 72.83 | 0.79 |
| Random Forest | 78.73 | 64.78 | 71.75 | 0.78 |
| SVM | 79.21 | 77.04 | 78.15 | 0.84 |
Figure 3A semi-screenshot for the top page of the web-server at http://lin.uestc.edu.cn/server/m6Apred.php.
Figure 4Chemical structure of each nucleotide.
Chemical property of nucleotide in RNA sequence.
| Chemical property | Class | Nucleotides |
|---|---|---|
| Ring Structure | Purine | A, G |
| Pyrimidine | C, U | |
| Functional Group | Amino | A, C |
| Keto | G, U | |
| Hydrogen Bond | Strong | C, G |
| Weak | A, U |