Literature DB >> 33600435

Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species.

Zutan Li1, Hangjin Jiang2, Lingpeng Kong1, Yuanyuan Chen1, Kun Lang3, Xiaodan Fan4, Liangyun Zhang1, Cong Pian1.   

Abstract

N6-methyladenine (6mA) is an important DNA modification form associated with a wide range of biological processes. Identifying accurately 6mA sites on a genomic scale is crucial for under-standing of 6mA's biological functions. However, the existing experimental techniques for detecting 6mA sites are cost-ineffective, which implies the great need of developing new computational methods for this problem. In this paper, we developed, without requiring any prior knowledge of 6mA and manually crafted sequence features, a deep learning framework named Deep6mA to identify DNA 6mA sites, and its performance is superior to other DNA 6mA prediction tools. Specifically, the 5-fold cross-validation on a benchmark dataset of rice gives the sensitivity and specificity of Deep6mA as 92.96% and 95.06%, respectively, and the overall prediction accuracy is 94%. Importantly, we find that the sequences with 6mA sites share similar patterns across different species. The model trained with rice data predicts well the 6mA sites of other three species: Arabidopsis thaliana, Fragaria vesca and Rosa chinensis with a prediction accuracy over 90%. In addition, we find that (1) 6mA tends to occur at GAGG motifs, which means the sequence near the 6mA site may be conservative; (2) 6mA is enriched in the TATA box of the promoter, which may be the main source of its regulating downstream gene expression.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33600435      PMCID: PMC7924747          DOI: 10.1371/journal.pcbi.1008767

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  33 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  E. coli oriC and the dnaA gene promoter are sequestered from dam methyltransferase following the passage of the chromosomal replication fork.

Authors:  J L Campbell; N Kleckner
Journal:  Cell       Date:  1990-09-07       Impact factor: 41.582

3.  MM-6mAPred: identifying DNA N6-methyladenine sites based on Markov model.

Authors:  Cong Pian; Guangle Zhang; Fei Li; Xiaodan Fan
Journal:  Bioinformatics       Date:  2020-01-15       Impact factor: 6.937

4.  i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome.

Authors:  Wei Chen; Hao Lv; Fulei Nie; Hao Lin
Journal:  Bioinformatics       Date:  2019-08-15       Impact factor: 6.937

5.  Identification and analysis of adenine N6-methylation sites in the rice genome.

Authors:  Chao Zhou; Changshi Wang; Hongbo Liu; Qiangwei Zhou; Qian Liu; Yan Guo; Ting Peng; Jiaming Song; Jianwei Zhang; Lingling Chen; Yu Zhao; Zhixiong Zeng; Dao-Xiu Zhou
Journal:  Nat Plants       Date:  2018-07-30       Impact factor: 15.793

6.  Initiation of methyl-directed mismatch repair.

Authors:  K G Au; K Welsh; P Modrich
Journal:  J Biol Chem       Date:  1992-06-15       Impact factor: 5.157

7.  Direct detection of DNA methylation during single-molecule, real-time sequencing.

Authors:  Benjamin A Flusberg; Dale R Webster; Jessica H Lee; Kevin J Travers; Eric C Olivares; Tyson A Clark; Jonas Korlach; Stephen W Turner
Journal:  Nat Methods       Date:  2010-05-09       Impact factor: 28.547

8.  MDR: an integrative DNA N6-methyladenine and N4-methylcytosine modification database for Rosaceae.

Authors:  Zhao-Yu Liu; Jian-Feng Xing; Wei Chen; Mei-Wei Luan; Rui Xie; Jing Huang; Shang-Qian Xie; Chuan-Le Xiao
Journal:  Hortic Res       Date:  2019-06-15       Impact factor: 6.793

9.  csDMA: an improved bioinformatics tool for identifying DNA 6 mA modifications via Chou's 5-step rule.

Authors:  Ze Liu; Wei Dong; Wei Jiang; Zili He
Journal:  Sci Rep       Date:  2019-09-11       Impact factor: 4.379

10.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

View more
  5 in total

Review 1.  Machine learning: its challenges and opportunities in plant system biology.

Authors:  Mohsen Hesami; Milad Alizadeh; Andrew Maxwell Phineas Jones; Davoud Torkamaneh
Journal:  Appl Microbiol Biotechnol       Date:  2022-05-16       Impact factor: 4.813

2.  BERT6mA: prediction of DNA N6-methyladenine site using deep learning-based approaches.

Authors:  Sho Tsukiyama; Md Mehedi Hasan; Hong-Wen Deng; Hiroyuki Kurata
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

3.  Leveraging the attention mechanism to improve the identification of DNA N6-methyladenine sites.

Authors:  Ying Zhang; Yan Liu; Jian Xu; Xiaoyu Wang; Xinxin Peng; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

Review 4.  DNA N6-Methyladenine Modification in Eukaryotic Genome.

Authors:  Hao Li; Ning Zhang; Yuechen Wang; Siyuan Xia; Yating Zhu; Chen Xing; Xuefeng Tian; Yinan Du
Journal:  Front Genet       Date:  2022-06-24       Impact factor: 4.772

5.  iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations.

Authors:  Junru Jin; Yingying Yu; Ruheng Wang; Xin Zeng; Chao Pang; Yi Jiang; Zhongshen Li; Yutong Dai; Ran Su; Quan Zou; Kenta Nakai; Leyi Wei
Journal:  Genome Biol       Date:  2022-10-17       Impact factor: 17.906

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.