Literature DB >> 32413127

SOMM4mC: a second-order Markov model for DNA N4-methylcytosine site prediction in six species.

Jiali Yang1,2, Kun Lang3, Guangle Zhang4, Xiaodan Fan5, Yuanyuan Chen1, Cong Pian1,5.   

Abstract

MOTIVATION: DNA N4-methylcytosine (4mC) modification is an important epigenetic modification in prokaryotic DNA due to its role in regulating DNA replication and protecting the host DNA against degradation. An efficient algorithm to identify 4mC sites is needed for downstream analyses.
RESULTS: In this study, we propose a new prediction method named SOMM4mC based on a second-order Markov model, which makes use of the transition probability between adjacent nucleotides to identify 4mC sites. The results show that the first-order and second-order Markov model are superior to the three existing algorithms in all six species (Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Escherichia coli, Geoalkalibacter subterruneus and Geobacter pickeringii) where benchmark datasets are available. However, the classification performance of SOMM4mC is more outstanding than that of first-order Markov model. Especially, for E.coli and C.elegans, the overall accuracy of SOMM4mC are 91.8% and 87.6%, which are 8.5% and 6.1% higher than those of the latest method 4mcPred-SVM, respectively. This shows that more discriminant sequence information is captured by SOMM4mC through the dependency between adjacent nucleotides.
AVAILABILITY AND IMPLEMENTATION: The web server of SOMM4mC is freely accessible at www.insect-genome.com/SOMM4mC. CONTACT: chenyuanyuan@njau.edu.cn or piancong@njau.edu.cn.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2020        PMID: 32413127     DOI: 10.1093/bioinformatics/btaa507

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  i4mC-Deep: An Intelligent Predictor of N4-Methylcytosine Sites Using a Deep Learning Approach with Chemical Properties.

Authors:  Waleed Alam; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2021-07-23       Impact factor: 4.096

2.  4mCPred-CNN-Prediction of DNA N4-Methylcytosine in the Mouse Genome Using a Convolutional Neural Network.

Authors:  Zeeshan Abbas; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2021-02-20       Impact factor: 4.096

3.  Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model.

Authors:  Cong Pian; Zhixin Yang; Yuqian Yang; Liangyun Zhang; Yuanyuan Chen
Journal:  Front Genet       Date:  2021-03-19       Impact factor: 4.599

4.  Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation.

Authors:  Jhabindra Khanal; Hilal Tayara; Quan Zou; Kil To Chong
Journal:  Comput Struct Biotechnol J       Date:  2021-03-19       Impact factor: 7.271

5.  Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.

Authors:  Lezheng Yu; Yonglin Zhang; Li Xue; Fengjuan Liu; Qi Chen; Jiesi Luo; Runyu Jing
Journal:  Front Microbiol       Date:  2022-03-15       Impact factor: 5.640

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.