Literature DB >> 30601980

Promoter analysis and prediction in the human genome using sequence-based deep learning models.

Ramzan Umarov1, Hiroyuki Kuwahara1, Yu Li1, Xin Gao1, Victor Solovyev2.   

Abstract

MOTIVATION: Computational identification of promoters is notoriously difficult as human genes often have unique promoter sequences that provide regulation of transcription and interaction with transcription initiation complex. While there are many attempts to develop computational promoter identification methods, we have no reliable tool to analyze long genomic sequences.
RESULTS: In this work, we further develop our deep learning approach that was relatively successful to discriminate short promoter and non-promoter sequences. Instead of focusing on the classification accuracy, in this work we predict the exact positions of the transcription start site inside the genomic sequences testing every possible location. We studied human promoters to find effective regions for discrimination and built corresponding deep learning models. These models use adaptively constructed negative set, which iteratively improves the model's discriminative ability. Our method significantly outperforms the previously developed promoter prediction programs by considerably reducing the number of false-positive predictions. We have achieved error-per-1000-bp rate of 0.02 and have 0.31 errors per correct prediction, which is significantly better than the results of other human promoter predictors.
AVAILABILITY AND IMPLEMENTATION: The developed method is available as a web server at http://www.cbrc.kaust.edu.sa/PromID/.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30601980     DOI: 10.1093/bioinformatics/bty1068

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  A deep dense inception network for protein beta-turn prediction.

Authors:  Chao Fang; Yi Shang; Dong Xu
Journal:  Proteins       Date:  2019-07-23

2.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

3.  Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.

Authors:  Meng Zhang; Cangzhi Jia; Fuyi Li; Chen Li; Yan Zhu; Tatsuya Akutsu; Geoffrey I Webb; Quan Zou; Lachlan J M Coin; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

4.  Protein-RNA interaction prediction with deep learning: structure matters.

Authors:  Junkang Wei; Siyuan Chen; Licheng Zong; Xin Gao; Yu Li
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

5.  Integrating convolution and self-attention improves language model of human genome for interpreting non-coding regions at base-resolution.

Authors:  Meng Yang; Lichao Huang; Haiping Huang; Hui Tang; Nan Zhang; Huanming Yang; Jihong Wu; Feng Mu
Journal:  Nucleic Acids Res       Date:  2022-08-12       Impact factor: 19.160

6.  Investigating the Genomic Background of CRISPR-Cas Genomes for CRISPR-Based Antimicrobials.

Authors:  Hyunjin Shim
Journal:  Evol Bioinform Online       Date:  2022-06-08       Impact factor: 2.031

7.  Precise Prediction of Calpain Cleavage Sites and Their Aberrance Caused by Mutations in Cancer.

Authors:  Ze-Xian Liu; Kai Yu; Jingsi Dong; Linhong Zhao; Zekun Liu; Qingfeng Zhang; Shihua Li; Yimeng Du; Han Cheng
Journal:  Front Genet       Date:  2019-08-08       Impact factor: 4.599

8.  Model-driven generation of artificial yeast promoters.

Authors:  Benjamin J Kotopka; Christina D Smolke
Journal:  Nat Commun       Date:  2020-04-30       Impact factor: 14.919

9.  Identification of Regulatory SNPs Associated with Vicine and Convicine Content of Vicia faba Based on Genotyping by Sequencing Data Using Deep Learning.

Authors:  Felix Heinrich; Martin Wutke; Pronaya Prosun Das; Miriam Kamp; Mehmet Gültas; Wolfgang Link; Armin Otto Schmitt
Journal:  Genes (Basel)       Date:  2020-06-05       Impact factor: 4.096

10.  SpliceFinder: ab initio prediction of splice sites using convolutional neural network.

Authors:  Ruohan Wang; Zishuai Wang; Jianping Wang; Shuaicheng Li
Journal:  BMC Bioinformatics       Date:  2019-12-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.