| Literature DB >> 28079126 |
Wei Chen1, Pengwei Xing2, Quan Zou2,3.
Abstract
As one of the most abundant RNA post-transcriptional modifications, N6-methyladenosine (m6A) involves in a broad spectrum of biological and physiological processes ranging from mRNA splicing and stability to cell differentiation and reprogramming. However, experimental identification of m6A sites is expensive and laborious. Therefore, it is urgent to develop computational methods for reliable prediction of m6A sites from primary RNA sequences. In the current study, a new method called RAM-ESVM was developed for detecting m6A sites from Saccharomyces cerevisiae transcriptome, which employed ensemble support vector machine classifiers and novel sequence features. The jackknife test results show that RAM-ESVM outperforms single support vector machine classifiers and other existing methods, indicating that it would be a useful computational tool for detecting m6A sites in S. cerevisiae. Furthermore, a web server named RAM-ESVM was constructed and could be freely accessible at http://server.malab.cn/RAM-ESVM/.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28079126 PMCID: PMC5227715 DOI: 10.1038/srep40242
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of different parameters for identifying m6A sites.
| Parameters | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|
| 32D | 64.27 | 55.78 | 60.02 | 0.20 |
| 98D | 70.00 | 63.42 | 66.71 | 0.33 |
| motif | 66.25 | 78.56 | 72.41 | 0.45 |
| PseDNC | 71.08 | 60.21 | 65.65 | 0.31 |
Comparison of SVM with other classifiers for identifying m6A sites.
| Classifiers | Parameters | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|---|
| Naïve Bayes | motif | 84.92 | 50.49 | 67.71 | 0.38 |
| PseDNC | 74.98 | 51.87 | 63.43 | 0.27 | |
| Random Forest | motif | 66.64 | 75.59 | 71.11 | 0.42 |
| PseDNC | 65.72 | 60.52 | 63.12 | 0.26 | |
| J48 | motif | 62.74 | 68.94 | 65.84 | 0.32 |
| PseDNC | 62.89 | 51.26 | 57.08 | 0.14 | |
| KNN | motif | 32.36 | 86.91 | 59.64 | 0.23 |
| PseDNC | 57.84 | 54.39 | 56.12 | 0.12 | |
| SVM | motif | 66.25 | 78.56 | 72.41 | 0.45 |
| PseDNC | 71.08 | 60.21 | 65.65 | 0.31 |
Performance of ensemble SVM and the single SVMs.
| Parameters | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|
| motif | 66.25 | 78.56 | 72.41 | 0.45 |
| PseDNC | 71.08 | 60.21 | 65.65 | 0.31 |
| gksvm | 72.03 | 77.39 | 74.71 | 0.49 |
| Ensemble SVM | 78.93 | 77.78 | 78.35 | 0.57 |
Comparative results for identifying m6A sites between different methods.
| Predictor | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|
| M6A-HPCS | 77.35 | 67.41 | 72.38 | 0.45 |
| RAM-ESVM | 78.93 | 77.78 | 78.35 | 0.57 |
Figure 1A semi-screenshot for the top-page of the RAM-ESVM web-server at http://server.malab.cn/RAM-ESVM/.
The original enthalpy, entropy and free energy values of the dinucleotides.
| Dinucleotide | Enthalpy | Entropy | Free energy |
|---|---|---|---|
| GG | −12.2 | −29.7 | −3.26 |
| GA | −13.3 | −35.5 | −2.35 |
| GC | −14.2 | −34.9 | −3.42 |
| GU | −10.2 | −26.2 | −2.24 |
| AG | −7.6 | −19.2 | −2.08 |
| AA | −6.6 | −18.4 | −0.93 |
| AC | −10.2 | −26.2 | −2.24 |
| AU | −5.7 | −15.5 | −1.10 |
| CG | −8.0 | −19.4 | −2.36 |
| CA | −10.5 | −27.8 | −2.11 |
| CC | −12.2 | −29.7 | −3.26 |
| CU | −7.6 | −19.2 | −2.08 |
| UG | −7.6 | −19.2 | −2.11 |
| UA | −8.1 | −22.6 | −1.33 |
| UC | −10.2 | −26.2 | −2.35 |
| UU | −6.6 | −18.4 | −0.93 |
Figure 2The workflow of RAM-ESVM.