Literature DB >> 35225328

BERT6mA: prediction of DNA N6-methyladenine site using deep learning-based approaches.

Sho Tsukiyama1, Md Mehedi Hasan2, Hong-Wen Deng2, Hiroyuki Kurata1.   

Abstract

N6-methyladenine (6mA) is associated with important roles in DNA replication, DNA repair, transcription, regulation of gene expression. Several experimental methods were used to identify DNA modifications. However, these experimental methods are costly and time-consuming. To detect the 6mA and complement these shortcomings of experimental methods, we proposed a novel, deep leaning approach called BERT6mA. To compare the BERT6mA with other deep learning approaches, we used the benchmark datasets including 11 species. The BERT6mA presented the highest AUCs in eight species in independent tests. Furthermore, BERT6mA showed higher and comparable performance with the state-of-the-art models while the BERT6mA showed poor performances in a few species with a small sample size. To overcome this issue, pretraining and fine-tuning between two species were applied to the BERT6mA. The pretrained and fine-tuned models on specific species presented higher performances than other models even for the species with a small sample size. In addition to the prediction, we analyzed the attention weights generated by BERT6mA to reveal how the BERT6mA model extracts critical features responsible for the 6mA prediction. To facilitate biological sciences, the BERT6mA online web server and its source codes are freely accessible at https://github.com/kuratahiroyuki/BERT6mA.git, respectively.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  6mA modification prediction; BERT; CNN; GRU; LSTM; word2vec

Mesh:

Substances:

Year:  2022        PMID: 35225328      PMCID: PMC8921755          DOI: 10.1093/bib/bbac053

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  39 in total

1.  E. coli oriC and the dnaA gene promoter are sequestered from dam methyltransferase following the passage of the chromosomal replication fork.

Authors:  J L Campbell; N Kleckner
Journal:  Cell       Date:  1990-09-07       Impact factor: 41.582

2.  BERT4Bitter: a bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides.

Authors:  Phasit Charoenkwan; Chanin Nantasenamat; Md Mehedi Hasan; Balachandran Manavalan; Watshara Shoombuatong
Journal:  Bioinformatics       Date:  2021-02-26       Impact factor: 6.937

3.  Detection of DNA Methylation in Genomic DNA by UHPLC-MS/MS.

Authors:  Konstantinos Boulias; Eric Lieberman Greer
Journal:  Methods Mol Biol       Date:  2021

4.  Direct detection of DNA methylation during single-molecule, real-time sequencing.

Authors:  Benjamin A Flusberg; Dale R Webster; Jessica H Lee; Kevin J Travers; Eric C Olivares; Tyson A Clark; Jonas Korlach; Stephen W Turner
Journal:  Nat Methods       Date:  2010-05-09       Impact factor: 28.547

5.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

6.  Identifying antimicrobial peptides using word embedding with deep recurrent neural networks.

Authors:  Md-Nafiz Hamid; Iddo Friedberg
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

7.  MDR: an integrative DNA N6-methyladenine and N4-methylcytosine modification database for Rosaceae.

Authors:  Zhao-Yu Liu; Jian-Feng Xing; Wei Chen; Mei-Wei Luan; Rui Xie; Jing Huang; Shang-Qian Xie; Chuan-Le Xiao
Journal:  Hortic Res       Date:  2019-06-15       Impact factor: 6.793

8.  6mA-Pred: identifying DNA N6-methyladenine sites based on deep learning.

Authors:  Qianfei Huang; Wenyang Zhou; Fei Guo; Lei Xu; Lichao Zhang
Journal:  PeerJ       Date:  2021-02-03       Impact factor: 2.984

9.  Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species.

Authors:  Zutan Li; Hangjin Jiang; Lingpeng Kong; Yuanyuan Chen; Kun Lang; Xiaodan Fan; Liangyun Zhang; Cong Pian
Journal:  PLoS Comput Biol       Date:  2021-02-18       Impact factor: 4.475

10.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

View more
  2 in total

Review 1.  DNA N6-Methyladenine Modification in Eukaryotic Genome.

Authors:  Hao Li; Ning Zhang; Yuechen Wang; Siyuan Xia; Yating Zhu; Chen Xing; Xuefeng Tian; Yinan Du
Journal:  Front Genet       Date:  2022-06-24       Impact factor: 4.772

2.  iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations.

Authors:  Junru Jin; Yingying Yu; Ruheng Wang; Xin Zeng; Chao Pang; Yi Jiang; Zhongshen Li; Yutong Dai; Ran Su; Quan Zou; Kenta Nakai; Leyi Wei
Journal:  Genome Biol       Date:  2022-10-17       Impact factor: 17.906

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.