| Literature DB >> 32211035 |
Feng Zeng1, Guanyun Fang1, Lan Yao2.
Abstract
Motivation: N4-methylcytosine (4mC) plays an important role in host defense and transcriptional regulation. Accurate identification of 4mc sites provides a more comprehensive understanding of its biological effects. At present, the traditional machine learning algorithms are used in the research on 4mC sites prediction, but the complexity of the algorithms is relatively high, which is not suitable for the processing of large data sets, and the accuracy of prediction needs to be improved. Therefore, it is necessary to develop a new and effective method to accurately identify 4mC sites.Entities:
Keywords: BLSTM; CNN; N4-methylcytosine; deep neural network; integrated algorithm; machine learning
Year: 2020 PMID: 32211035 PMCID: PMC7067889 DOI: 10.3389/fgene.2020.00209
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1A graphical illustration of the 4mcDeep-CBI model.
Figure 2Evaluate the performance of preliminary feature and advanced feature on the same data set.
Figure 3Acc-loss curve of AD_BKF based on 3-CNN and BLSTM models. Where AD_BKF is a advanced feature of BKF.
Figure 4Experimental result graph after using integrated algorithm.
Figure 5Performance evaluation of our predictor and the state-of-the-art predictor on the same dataset.
Figure 6ROC curves of our predictor and the state-of-the-art predictor on the same dataset.
Running time of the main modules of 4mcPred-IFL and 4mcDeep-CBI.
| 1,000 | 31.3 | 9.2 | 3.1 |
| 4,000 | 1034.4 | 222.1 | 10.7 |
| 7,000 | 3123.6 | 698.4 | 19.8 |
| 10,000 | 6255.8 | 1365.6 | 24.5 |
| 13,000 | 9449.5 | 2173.2 | 35.1 |
| 16,000 | 15094.4 | 3261.3 | 48.2 |
ACC of 4mcDeep-CBI with 4 CNN layers under different parameters.
| 4, 8, 16, 32 | 4, 4, 4, 4 | 90.02 |
| 4, 8, 16, 32 | 8, 8, 8, 8 | 89.46 |
| 4, 8, 16, 32 | 16, 16, 16, 16 | 88.70 |
| 8, 16, 32, 64 | 4, 4, 4, 4 | 90.17 |
| 8, 16, 32, 64 | 8, 8, 8, 8 | 90.02 |
| 8, 16, 32, 64 | 16, 16, 16, 16 | 89.25 |
| 16, 32, 64, 128 | 4, 4, 4, 4 | 89.78 |
| 16, 32, 64, 128 | 8, 8, 8, 8 | 89.37 |
| 16, 32, 64, 128 | 16, 16, 16, 16 | 89.18 |
| 32, 16, 8, 4 | 4, 4, 4, 4 | 89.36 |
| 32, 16, 8, 4 | 8, 8, 8, 8 | 89.29 |
| 32, 16, 8, 4 | 16, 16, 16, 16 | 88.31 |
| 64, 32, 16, 8 | 4, 4, 4, 4 | 89.89 |
| 64, 32, 16, 8 | 8, 8, 8, 8 | 88.72 |
| 64, 32, 16, 8 | 16, 16, 16, 16 | 87.97 |
| 128, 64, 32, 16 | 4, 4, 4, 4 | 90.03 |
| 128, 64, 32, 16 | 8, 8, 8, 8 | 89.96 |
| 128, 64, 32, 16 | 16, 16, 16, 16 | 89.09 |
Figure 7Impact of different CNN layers on ACC.