Literature DB >> 33672576

4mCPred-CNN-Prediction of DNA N4-Methylcytosine in the Mouse Genome Using a Convolutional Neural Network.

Zeeshan Abbas1,2, Hilal Tayara3, Kil To Chong1,4.   

Abstract

Among DNA modifications, N4-methylcytosine (4mC) is one of the most significant ones, and it is linked to the development of cell proliferation and gene expression. To know different its biological functions, the accurate detection of 4mC sites is required. Although we have several techniques for the prediction of 4mC sites in different genomes based on both machine learning (ML) and convolutional neural networks (CNNs), there is no CNN-based tool for the identification of 4mC sites in the mouse genome. In this article, a CNN-based model named 4mCPred-CNN was developed to classify 4mC locations in the mouse genome. Until now, we had only two ML-based models for this purpose; they utilized several feature encoding schemes, and thus still had a lot of space available to improve the prediction accuracy. Utilizing only a single feature encoding scheme-one-hot encoding-we outperformed both of the previous ML-based techniques. In a ten-fold validation test, the proposed model, 4mCPred-CNN, achieved an accuracy of 85.71% and Matthews correlation coefficient (MCC) of 0.717. On an independent dataset, the achieved accuracy was 87.50% with an MCC value of 0.750. The attained results exhibit that the proposed model can be of great use for researchers in the fields of biology and bioinformatics.

Entities:  

Keywords:  N4-methylcytosine; computational biology; epigenetics; neural networks

Mesh:

Substances:

Year:  2021        PMID: 33672576      PMCID: PMC7924022          DOI: 10.3390/genes12020296

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


  34 in total

Review 1.  Mass spectrometry and protein analysis.

Authors:  Bruno Domon; Ruedi Aebersold
Journal:  Science       Date:  2006-04-14       Impact factor: 47.728

2.  iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.

Authors:  Zhen Chen; Pei Zhao; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Jerico Revote; Yan Zhu; David R Powell; Tatsuya Akutsu; Geoffrey I Webb; Kuo-Chen Chou; A Ian Smith; Roger J Daly; Jian Li; Jiangning Song
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

Review 3.  A Survey for Predicting Enzyme Family Classes Using Machine Learning Methods.

Authors:  Jiu-Xin Tan; Hao Lv; Fang Wang; Fu-Ying Dao; Wei Chen; Hui Ding
Journal:  Curr Drug Targets       Date:  2019       Impact factor: 3.465

Review 4.  New concepts in DNA methylation.

Authors:  Albert Jeltsch; Renata Z Jurkowska
Journal:  Trends Biochem Sci       Date:  2014-06-16       Impact factor: 13.807

5.  SOMM4mC: a second-order Markov model for DNA N4-methylcytosine site prediction in six species.

Authors:  Jiali Yang; Kun Lang; Guangle Zhang; Xiaodan Fan; Yuanyuan Chen; Cong Pian
Journal:  Bioinformatics       Date:  2020-08-15       Impact factor: 6.937

6.  DNA methylation on N(6)-adenine in mammalian embryonic stem cells.

Authors:  Tao P Wu; Tao Wang; Matthew G Seetin; Yongquan Lai; Shijia Zhu; Kaixuan Lin; Yifei Liu; Stephanie D Byrum; Samuel G Mackintosh; Mei Zhong; Alan Tackett; Guilin Wang; Lawrence S Hon; Gang Fang; James A Swenberg; Andrew Z Xiao
Journal:  Nature       Date:  2016-03-30       Impact factor: 49.962

Review 7.  A primer on deep learning in genomics.

Authors:  James Zou; Mikael Huss; Abubakar Abid; Pejman Mohammadi; Ali Torkamani; Amalio Telenti
Journal:  Nat Genet       Date:  2018-11-26       Impact factor: 38.330

8.  Abundant DNA 6mA methylation during early embryogenesis of zebrafish and pig.

Authors:  Jianzhao Liu; Yuanxiang Zhu; Guan-Zheng Luo; Xinxia Wang; Yanan Yue; Xiaona Wang; Xin Zong; Kai Chen; Hang Yin; Ye Fu; Dali Han; Yizhen Wang; Dahua Chen; Chuan He
Journal:  Nat Commun       Date:  2016-10-07       Impact factor: 14.919

Review 9.  Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics.

Authors:  Simon Ardui; Adam Ameur; Joris R Vermeesch; Matthew S Hestand
Journal:  Nucleic Acids Res       Date:  2018-03-16       Impact factor: 16.971

10.  iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice.

Authors:  Hao Lv; Fu-Ying Dao; Zheng-Xing Guan; Dan Zhang; Jiu-Xin Tan; Yong Zhang; Wei Chen; Hao Lin
Journal:  Front Genet       Date:  2019-09-10       Impact factor: 4.599

View more
  1 in total

1.  Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.

Authors:  Lezheng Yu; Yonglin Zhang; Li Xue; Fengjuan Liu; Qi Chen; Jiesi Luo; Runyu Jing
Journal:  Front Microbiol       Date:  2022-03-15       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.