| Literature DB >> 32397453 |
Omid Mahmoudi1, Abdul Wahab1, Kil To Chong2.
Abstract
One of the most common and well studied post-transcription modifications in RNAs is N6-methyladenosine (m6A) which has been involved with a wide range of biological processes. Over the past decades, N6-methyladenosine produced some positive consequences through the high-throughput laboratory techniques but still, these lab processes are time consuming and costly. Diverse computational methods have been proposed to identify m6A sites accurately. In this paper, we proposed a computational model named iMethyl-deep to identify m6A Saccharomyces Cerevisiae on two benchmark datasets M6A2614 and M6A6540 by using single nucleotide resolution to convert RNA sequence into a high quality feature representation. The iMethyl-deep obtained 89.19% and 87.44% of accuracy on M6A2614 and M6A6540 respectively which show that our proposed method outperforms the state-of-the-art predictors, at least 8.44%, 8.96%, 8.69% and 0.173 on M6A2614 and 15.47%, 28.52%, 25.54 and 0.5 on M6A6540 higher in terms of four metrics Sp, Sn, ACC and MCC respectively. Meanwhile, M6A6540 dataset never used to train a model.Entities:
Keywords: RNA N6-methyladenosine site; bioinformatics; computational biology; deep learning; methylation; yeast genome
Mesh:
Substances:
Year: 2020 PMID: 32397453 PMCID: PMC7288457 DOI: 10.3390/genes11050529
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Benchmark datasets demonstration.
| Datasets | Positive | Negative | Total |
|---|---|---|---|
| M6A2614 | 1307 | 1307 | 2614 |
| M6A6540 | 3270 | 3270 | 6540 |
Range of Hyper-parameters.
| Parameters | Range |
|---|---|
| Convolution layers | [1, 2, 3, 4] |
| Filters in convolution Layer | [6, 8, 16, 24, 32, 44, 64] |
| Filter size | [2, 4, 5, 7, 8, 10, 13] |
| Pool-size in Maxpooling | [2, 4] |
| Stride length in Maxpooling | [2, 4] |
| Dropout values | [0.3, 0.35, 0.4, 0.45, 0.5] |
The architecture of the proposed model.
| Layer | Output Shape |
|---|---|
| Input | (51, 4) |
| Conv1D(16, 5, 1) | (47, 16) |
| ELU | (47, 16) |
| GroupNormalization(4) | (47, 16) |
| MaxPool1D (4, 2) | (22, 16) |
| Conv1D(16, 5, 1) | (18,16) |
| ELU | (18, 16) |
| GroupNormalization(4) | (18, 16) |
| MaxPool1D(4,2) | (8, 16) |
| Flatten | (128) |
| Dropout(0.35) | (128) |
| Dense(32) | (32) |
| Dense(1) | 1 |
| Sigmoid | 1 |
Figure 1A graphical illustration of iMethyl-deep. Inputted RNA sequence converted into one-hot encoded, then fed into the Convolution Neural Network (CNN) layers for training the datasets.
Performance comparison of iMethyl-deep with other four state-of-the-art methods on M6A2614 dataset. Overall accuracy (ACC), Mathew’s correlation coefficient (MCC), specificity (Sp), and sensitivity (Sn).
| Model | Sp (%) | Sn (%) | ACC (%) | MCC |
|---|---|---|---|---|
| iRNA-Methyl | 60.63 | 70.55 | 65.59 | 0.29 |
| RAM-ESVM | 77.78 | 78.93 | 78.35 | 0.57 |
| RAM-NPPS | 80.87 | 78.42 | 79.65 | 0.59 |
| DeepM6APred | 81.48 | 79.50 | 80.50 | 0.61 |
| iMethyl-deep | 89.92 | 88.46 | 89.19 | 0.78 |
Figure 2Performance evaluation illustration of iMethyl-deep on M6A2146 dataset.
Figure 3The receiver operating characteristics (ROC) curve of iMethyl-deep on M6A2614 dataset.
Figure 4Graphical illustration of confusion matrix of iMethyl-deep on M6A2614 dataset.
The results of iMethyl-deep on benckmark M6A6540 dataset.
| Model | Sp (%) | Sn (%) | ACC (%) | MCC |
|---|---|---|---|---|
| RAM-NPPS | 71.07 | 34.59 | 52.83 | 0.06 |
| iRNA-Methyl | 61.68 | 59.82 | 60.75 | 0.22 |
| RAM-ESVM | 64.53 | 59.27 | 61.90 | 0.24 |
| iMethyl-deep | 86.54 | 88.34 | 87.44 | 0.74 |
Figure 5Performance evaluation illustration of iMethyl-deep on M6A6540 dataset.
Figure 6The receiver operating characteristics (ROC) curve of iMethyl-deep on M6A6540 dataset.
Figure 7Graphical illustration of confusion matrix of iMethyl-deep on M6A6540 dataset.