Bite Yang1, Feng Liu1,2, Chao Ren1, Zhangyi Ouyang1, Ziwei Xie3, Xiaochen Bo1, Wenjie Shu1. 1. Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850. 2. Department of Information, The 188th Hospital of Chaozhou, Chaozhou 521000. 3. Department of Biomedical Engineering, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
Abstract
MOTIVATION: Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. RESULTS: We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. AVAILABILITY AND IMPLEMENTATION: Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . CONTACT: shuwj@bmi.ac.cn or boxc@bmi.ac.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. RESULTS: We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. AVAILABILITY AND IMPLEMENTATION: Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . CONTACT: shuwj@bmi.ac.cn or boxc@bmi.ac.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Muhammad Nabeel Asim; Muhammad Ali Ibrahim; Christoph Zehe; Johan Trygg; Andreas Dengel; Sheraz Ahmed Journal: Interdiscip Sci Date: 2022-08-10 Impact factor: 3.492