Literature DB >> 33978703

CrepHAN: Cross-species prediction of enhancers by using hierarchical attention networks.

Jianwei Hong1,2, Ruitian Gao3, Yang Yang1,4.   

Abstract

MOTIVATION: Enhancers are important functional elements in genome sequences. The identification of enhancers is a very challenging task due to the great diversity of enhancer sequences and the flexible localization on genomes. Till now, the interactions between enhancers and genes have not been fully understood yet. To speed up the studies of the regulatory roles of enhancers, computational tools for the prediction of enhancers have emerged in recent years. Especially, thanks to the ENCODE project and the advances of high-throughput experimental techniques, a large amount of experimentally verified enhancers have been annotated on the human genome, which allows large-scale predictions of unknown enhancers using data-driven methods. However, except for human and some model organisms, the validated enhancer annotations are scarce for most species, leading to more difficulties in the computational identification of enhancers for their genomes.
RESULTS: In this study, we propose a deep learning-based predictor for enhancers, named CrepHAN, which is featured by a hierarchical attention neural network and word embedding-based representations for DNA sequences. We use the experimentally-supported data of the human genome to train the model, and perform experiments on human and other mammals, including mouse, cow, and dog. The experimental results show that CrepHAN has more advantages on cross-species predictions, and outperforms the existing models by a large margin. Especially, for human-mouse cross-predictions, the AUC score of ROC curve is increased by 0.033∼0.145 on the combined tissue dataset and 0.032∼0.109 on tissue-specific datasets. AVAILABILITY: bcmi.sjtu.edu.cn/~yangyang/CrepHAN.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 33978703     DOI: 10.1093/bioinformatics/btab349

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  Comprehensive Genomic Discovery of Non-Coding Transcriptional Enhancers in the African Malaria Vector Anopheles coluzzii.

Authors:  Inge Holm; Luisa Nardini; Adrien Pain; Emmanuel Bischoff; Cameron E Anderson; Soumanaba Zongo; Wamdaogo M Guelbeogo; N'Fale Sagnon; Daryl M Gohl; Ronald J Nowling; Kenneth D Vernick; Michelle M Riehle
Journal:  Front Genet       Date:  2022-01-10       Impact factor: 4.772

Review 2.  From shallow to deep: some lessons learned from application of machine learning for recognition of functional genomic elements in human genome.

Authors:  Boris Jankovic; Takashi Gojobori
Journal:  Hum Genomics       Date:  2022-02-18       Impact factor: 4.639

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.