| Literature DB >> 31681441 |
Haitao Yu1, Zhiming Dai1,2.
Abstract
DNA N6-methyladenine (6mA) is an important epigenetic modification, which is involved in many biology regulation processes. An accurate and reliable method for 6mA identification can help us gain a better insight into the regulatory mechanism of the modification. Although many experimental techniques have been proposed to identify 6mA sites genome-wide, these techniques are time consuming and laborious. Recently, several machine learning methods have been developed to identify 6mA sites genome-wide. However, there is room for the improvement on their performance for predicting 6mA sites in rice genome. In this paper, we developed a simple and lightweight deep learning model to identify DNA 6mA sites in rice genome. Our model needs no prior knowledge of 6mA or manually crafted sequence feature. We built our model based on two rice 6mA benchmark datasets. Our method got an average prediction accuracy of ∼93% and ∼92% on the two datasets we used. We compared our method with existing 6mA prediction tools. The comparison results show that our model outperforms the state-of-the-art methods.Entities:
Keywords: DNA sequence; bioinformatics; deep learning; epigenetics; rice
Year: 2019 PMID: 31681441 PMCID: PMC6797597 DOI: 10.3389/fgene.2019.01071
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Receiver operating characteristic curves of SNNRice6mA on testing sets of 6mA-rice-Chen dataset and 6mA-rice-Lv dataset. (A) Performance on the 6mA-rice-Chen dataset. (B) Performance on the 6mA-rice-Lv dataset.
Performance comparison between SNNRice6mA and several previous methods on 6mA-Rice-Chen dataset.
| Method | Sensitivity (%) | Specificity (%) | Accuracy (%) | MCC | AUC |
|---|---|---|---|---|---|
| SNNRice6mA | 92.16 | 94.32 | 93.24 | 0.87 | 0.97 |
| SNNRice6mA | 90.34 | 92.95 | 91.65 | 0.83 | 0.98 |
| i6mA-Pred | 82.95 | 83.30 | 83.13 | 0.66 | 0.89 |
| MM-6mAPred | 89.32 | 90.11 | 89.72 | 0.79 | / |
| iDNA6mA | 86.70 | 86.59 | 86.64 | 0.73 | 0.93 |
| SDM6A | 85.20 | 90.90 | 88.10 | 0.76 | 0.94 |
| iDNA6mA-rice | 83.86 | 83.41 | 83.63 | 0.67 | 0.91 |
6mA, N6-methyladenine; AUC, area under the curve; MCC, Matthews correlation coefficient.
Performance comparison between SNNRice6mA and iDNA6mA-Rice on 6mA-Rice-Lv dataset.
| Method | Sensitivity (%) | Specificity (%) | Accuracy (%) | MCC | AUC |
|---|---|---|---|---|---|
| SNNRice6mA | 93.67 | 86.74 | 90.20 | 0.81 | 0.96 |
| SNNRice6mA-large | 94.33 | 89.75 | 92.04 | 0.84 | 0.97 |
| iDNA6mA-rice | 93.00 | 90.50 | 91.70 | 0.84 | 0.96 |
6mA, N6-methyladenine; AUC, area under the curve; MCC, Matthews correlation coefficient.