Literature DB >> 34664074

MAResNet: predicting transcription factor binding sites by combining multi-scale bottom-up and top-down attention and residual network.

Ke Han1, Long-Chen Shen1, Yi-Heng Zhu1, Jian Xu1, Jiangning Song2,3, Dong-Jun Yu1.   

Abstract

Accurate identification of transcription factor binding sites is of great significance in understanding gene expression, biological development and drug design. Although a variety of methods based on deep-learning models and large-scale data have been developed to predict transcription factor binding sites in DNA sequences, there is room for further improvement in prediction performance. In addition, effective interpretation of deep-learning models is greatly desirable. Here we present MAResNet, a new deep-learning method, for predicting transcription factor binding sites on 690 ChIP-seq datasets. More specifically, MAResNet combines the bottom-up and top-down attention mechanisms and a state-of-the-art feed-forward network (ResNet), which is constructed by stacking attention modules that generate attention-aware features. In particular, the multi-scale attention mechanism is utilized at the first stage to extract rich and representative sequence features. We further discuss the attention-aware features learned from different attention modules in accordance with the changes as the layers go deeper. The features learned by MAResNet are also visualized through the TMAP tool to illustrate that the method can extract the unique characteristics of transcription factor binding sites. The performance of MAResNet is extensively tested on 690 test subsets with an average AUC of 0.927, which is higher than that of the current state-of-the-art methods. Overall, this study provides a new and useful framework for the prediction of transcription factor binding sites by combining the funnel attention modules with the residual network.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep learning; multi-scale bottom-up and top-down attention; residual network; sequence analysis; transcription factor binding site

Mesh:

Substances:

Year:  2022        PMID: 34664074      PMCID: PMC8769703          DOI: 10.1093/bib/bbab445

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  37 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  Transcription factors-DNA interactions in rice: identification and verification.

Authors:  Zijie Shen; Yuan Lin; Quan Zou
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

3.  gkmSVM: an R package for gapped-kmer SVM.

Authors:  Mahmoud Ghandi; Morteza Mohammad-Noori; Narges Ghareghani; Dongwon Lee; Levi Garraway; Michael A Beer
Journal:  Bioinformatics       Date:  2016-04-19       Impact factor: 6.937

4.  Assessing computational tools for the discovery of transcription factor binding sites.

Authors:  Martin Tompa; Nan Li; Timothy L Bailey; George M Church; Bart De Moor; Eleazar Eskin; Alexander V Favorov; Martin C Frith; Yutao Fu; W James Kent; Vsevolod J Makeev; Andrei A Mironov; William Stafford Noble; Giulio Pavesi; Graziano Pesole; Mireille Régnier; Nicolas Simonis; Saurabh Sinha; Gert Thijs; Jacques van Helden; Mathias Vandenbogaert; Zhiping Weng; Christopher Workman; Chun Ye; Zhou Zhu
Journal:  Nat Biotechnol       Date:  2005-01       Impact factor: 54.908

5.  Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities.

Authors:  Ameni Trabelsi; Mohamed Chaabane; Asa Ben-Hur
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

6.  Visualization of very large high-dimensional data sets as minimum spanning trees.

Authors:  Daniel Probst; Jean-Louis Reymond
Journal:  J Cheminform       Date:  2020-02-12       Impact factor: 5.514

7.  Protein transfer learning improves identification of heat shock protein families.

Authors:  Seonwoo Min; HyunGi Kim; Byunghan Lee; Sungroh Yoon
Journal:  PLoS One       Date:  2021-05-18       Impact factor: 3.240

8.  CD-HIT Suite: a web server for clustering and comparing biological sequences.

Authors:  Ying Huang; Beifang Niu; Ying Gao; Limin Fu; Weizhong Li
Journal:  Bioinformatics       Date:  2010-01-06       Impact factor: 6.937

9.  DNA dynamics play a role as a basal transcription factor in the positioning and regulation of gene transcription initiation.

Authors:  Boian S Alexandrov; Vladimir Gelev; Sang Wook Yoo; Ludmil B Alexandrov; Yayoi Fukuyo; Alan R Bishop; Kim Ø Rasmussen; Anny Usheva
Journal:  Nucleic Acids Res       Date:  2009-12-17       Impact factor: 16.971

10.  TFBSTools: an R/bioconductor package for transcription factor binding site analysis.

Authors:  Ge Tan; Boris Lenhard
Journal:  Bioinformatics       Date:  2016-01-21       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.