Literature DB >> 32778871

DeepATT: a hybrid category attention neural network for identifying functional effects of DNA sequences.

Jiawei Li1, Yuqian Pu1, Jijun Tang2, Quan Zou3, Fei Guo1.   

Abstract

Quantifying DNA properties is a challenging task in the broad field of human genomics. Since the vast majority of non-coding DNA is still poorly understood in terms of function, this task is particularly important to have enormous benefit for biology research. Various DNA sequences should have a great variety of representations, and specific functions may focus on corresponding features in the front part of learning model. Currently, however, for multi-class prediction of non-coding DNA regulatory functions, most powerful predictive models do not have appropriate feature extraction and selection approaches for specific functional effects, so that it is difficult to gain a better insight into their internal correlations. Hence, we design a category attention layer and category dense layer in order to select efficient features and distinguish different DNA functions. In this study, we propose a hybrid deep neural network method, called DeepATT, for identifying $919$ regulatory functions on nearly $5$ million DNA sequences. Our model has four built-in neural network constructions: convolution layer captures regulatory motifs, recurrent layer captures a regulatory grammar, category attention layer selects corresponding valid features for different functions and category dense layer classifies predictive labels with selected features of regulatory functions. Importantly, we compare our novel method, DeepATT, with existing outstanding prediction tools, DeepSEA and DanQ. DeepATT performs significantly better than other existing tools for identifying DNA functions, at least increasing $1.6\%$ area under precision recall. Furthermore, we can mine the important correlation among different DNA functions according to the category attention module. Moreover, our novel model can greatly reduce the number of parameters by the mechanism of attention and locally connected, on the basis of ensuring accuracy.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  DNA function; category attention; deep neural network

Year:  2021        PMID: 32778871     DOI: 10.1093/bib/bbaa159

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  9 in total

1.  SNAREs-SAP: SNARE Proteins Identification With PSSM Profiles.

Authors:  Zixiao Zhang; Yue Gong; Bo Gao; Hongfei Li; Wentao Gao; Yuming Zhao; Benzhi Dong
Journal:  Front Genet       Date:  2021-12-20       Impact factor: 4.599

2.  Pseudo-188D: Phage Protein Prediction Based on a Model of Pseudo-188D.

Authors:  Xiaomei Gu; Lina Guo; Bo Liao; Qinghua Jiang
Journal:  Front Genet       Date:  2021-12-01       Impact factor: 4.599

3.  iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest.

Authors:  Dongxu Zhao; Zhixia Teng; Yanjuan Li; Dong Chen
Journal:  Front Genet       Date:  2021-11-30       Impact factor: 4.599

4.  KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest.

Authors:  Yuran Jia; Shan Huang; Tianjiao Zhang
Journal:  Front Genet       Date:  2021-11-29       Impact factor: 4.599

5.  The Characterization of Structure and Prediction for Aquaporin in Tumour Progression by Machine Learning.

Authors:  Zheng Chen; Shihu Jiao; Da Zhao; Quan Zou; Lei Xu; Lijun Zhang; Xi Su
Journal:  Front Cell Dev Biol       Date:  2022-02-01

6.  Supervised promoter recognition: a benchmark framework.

Authors:  Raul I Perez Martell; Alison Ziesel; Hosna Jabbari; Ulrike Stege
Journal:  BMC Bioinformatics       Date:  2022-04-02       Impact factor: 3.169

7.  Identifying and Classifying Enhancers by Dinucleotide-Based Auto-Cross Covariance and Attention-Based Bi-LSTM.

Authors:  Shulin Zhao; Qingfeng Pan; Quan Zou; Ying Ju; Lei Shi; Xi Su
Journal:  Comput Math Methods Med       Date:  2022-04-05       Impact factor: 2.238

8.  Identification of plant vacuole proteins by exploiting deep representation learning features.

Authors:  Shihu Jiao; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-06-08       Impact factor: 6.155

9.  scEpiLock: A Weakly Supervised Learning Framework for cis-Regulatory Element Localization and Variant Impact Quantification for Single-Cell Epigenetic Data.

Authors:  Yanwen Gong; Shushrruth Sai Srinivasan; Ruiyi Zhang; Kai Kessenbrock; Jing Zhang
Journal:  Biomolecules       Date:  2022-06-23
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.