Literature DB >> 34459479

Leveraging the attention mechanism to improve the identification of DNA N6-methyladenine sites.

Ying Zhang1, Yan Liu1, Jian Xu2, Xiaoyu Wang3, Xinxin Peng3, Jiangning Song3, Dong-Jun Yu2.   

Abstract

DNA N6-methyladenine is an important type of DNA modification that plays important roles in multiple biological processes. Despite the recent progress in developing DNA 6mA site prediction methods, several challenges remain to be addressed. For example, although the hand-crafted features are interpretable, they contain redundant information that may bias the model training and have a negative impact on the trained model. Furthermore, although deep learning (DL)-based models can perform feature extraction and classification automatically, they lack the interpretability of the crucial features learned by those models. As such, considerable research efforts have been focused on achieving the trade-off between the interpretability and straightforwardness of DL neural networks. In this study, we develop two new DL-based models for improving the prediction of N6-methyladenine sites, termed LA6mA and AL6mA, which use bidirectional long short-term memory to respectively capture the long-range information and self-attention mechanism to extract the key position information from DNA sequences. The performance of the two proposed methods is benchmarked and evaluated on the two model organisms Arabidopsis thaliana and Drosophila melanogaster. On the two benchmark datasets, LA6mA achieves an area under the receiver operating characteristic curve (AUROC) value of 0.962 and 0.966, whereas AL6mA achieves an AUROC value of 0.945 and 0.941, respectively. Moreover, an in-depth analysis of the attention matrix is conducted to interpret the important information, which is hidden in the sequence and relevant for 6mA site prediction. The two novel pipelines developed for DNA 6mA site prediction in this work will facilitate a better understanding of the underlying principle of DL-based DNA methylation site prediction and its future applications.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  6mA; DNA modification; LSTM; attention interpretation; deep learning; self-attention mechanism

Mesh:

Substances:

Year:  2021        PMID: 34459479      PMCID: PMC8575024          DOI: 10.1093/bib/bbab351

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  39 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  N6-methyldeoxyadenosine marks active transcription start sites in Chlamydomonas.

Authors:  Ye Fu; Guan-Zheng Luo; Kai Chen; Xin Deng; Miao Yu; Dali Han; Ziyang Hao; Jianzhao Liu; Xingyu Lu; Louis C Dore; Xiaocheng Weng; Quanjiang Ji; Laurens Mets; Chuan He
Journal:  Cell       Date:  2015-04-30       Impact factor: 41.582

3.  MM-6mAPred: identifying DNA N6-methyladenine sites based on Markov model.

Authors:  Cong Pian; Guangle Zhang; Fei Li; Xiaodan Fan
Journal:  Bioinformatics       Date:  2020-01-15       Impact factor: 6.937

4.  i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome.

Authors:  Wei Chen; Hao Lv; Fulei Nie; Hao Lin
Journal:  Bioinformatics       Date:  2019-08-15       Impact factor: 6.937

5.  Predicting the impact of non-coding variants on DNA methylation.

Authors:  Haoyang Zeng; David K Gifford
Journal:  Nucleic Acids Res       Date:  2017-06-20       Impact factor: 16.971

Review 6.  DNA methylation in mammals.

Authors:  En Li; Yi Zhang
Journal:  Cold Spring Harb Perspect Biol       Date:  2014-05-01       Impact factor: 10.005

Review 7.  N6-methyladenine: the other methylated base of DNA.

Authors:  David Ratel; Jean-Luc Ravanat; François Berger; Didier Wion
Journal:  Bioessays       Date:  2006-03       Impact factor: 4.345

8.  A coding measure scheme employing electron-ion interaction pseudopotential (EIIP).

Authors:  Achuthsankar S Nair; Sivarama Pillai Sreenadhan
Journal:  Bioinformation       Date:  2006-10-07

9.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

10.  Enhancing the interpretability of transcription factor binding site prediction using attention mechanism.

Authors:  Sungjoon Park; Yookyung Koh; Hwisang Jeon; Hyunjae Kim; Yoonsun Yeo; Jaewoo Kang
Journal:  Sci Rep       Date:  2020-08-07       Impact factor: 4.379

View more
  8 in total

1.  ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.

Authors:  Shihu Jiao; Zheng Chen; Lichao Zhang; Xun Zhou; Lei Shi
Journal:  Amino Acids       Date:  2022-03-14       Impact factor: 3.520

2.  AURKA is a prognostic potential therapeutic target in skin cutaneous melanoma modulating the tumor microenvironment, apoptosis, and hypoxia.

Authors:  ShengYong Long; Xuan Fen Zhang
Journal:  J Cancer Res Clin Oncol       Date:  2022-07-23       Impact factor: 4.322

3.  Pathogenesis and Therapeutic Targets of Focal Cortical Dysplasia Based on Bioinformatics Analysis.

Authors:  Ying Kan; Lijuan Feng; Yukun Si; Ziang Zhou; Wei Wang; Jigang Yang
Journal:  Neurochem Res       Date:  2022-08-09       Impact factor: 4.414

Review 4.  Machine learning in neuro-oncology: toward novel development fields.

Authors:  Vincenzo Di Nunno; Mario Fordellone; Giuseppe Minniti; Sofia Asioli; Alfredo Conti; Diego Mazzatenta; Damiano Balestrini; Paolo Chiodini; Raffaele Agati; Caterina Tonon; Alicia Tosoni; Lidia Gatto; Stefania Bartolini; Raffaele Lodi; Enrico Franceschi
Journal:  J Neurooncol       Date:  2022-06-28       Impact factor: 4.506

5.  Constructed the ceRNA network and predicted a FEZF1-AS1/miR-92b-3p/ZIC5 axis in colon cancer.

Authors:  Xiaoping Yang; Pingfan Wu; Zirui Wang; Xiaolu Su; Zhiping Wu; Xueni Ma; Fanqi Wu; Dekui Zhang
Journal:  Mol Cell Biochem       Date:  2022-10-11       Impact factor: 3.842

Review 6.  DNA N6-Methyladenine Modification in Eukaryotic Genome.

Authors:  Hao Li; Ning Zhang; Yuechen Wang; Siyuan Xia; Yating Zhu; Chen Xing; Xuefeng Tian; Yinan Du
Journal:  Front Genet       Date:  2022-06-24       Impact factor: 4.772

7.  Exploring the interaction mechanism between antagonist and the jasmonate receptor complex by molecular dynamics simulation.

Authors:  Mengqi Cui; Kun Zhang; Ruihan Wu; Juan Du
Journal:  J Comput Aided Mol Des       Date:  2022-01-20       Impact factor: 3.686

8.  BiLSTM-5mC: A Bidirectional Long Short-Term Memory-Based Approach for Predicting 5-Methylcytosine Sites in Genome-Wide DNA Promoters.

Authors:  Xin Cheng; Jun Wang; Qianyue Li; Taigang Liu
Journal:  Molecules       Date:  2021-12-07       Impact factor: 4.411

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.