Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Promoter analysis and prediction in the human genome using sequence-based deep learning models.

Literature DB >> 30601980

Promoter analysis and prediction in the human genome using sequence-based deep learning models.

Ramzan Umarov¹, Hiroyuki Kuwahara¹, Yu Li¹, Xin Gao¹, Victor Solovyev².

Abstract

MOTIVATION: Computational identification of promoters is notoriously difficult as human genes often have unique promoter sequences that provide regulation of transcription and interaction with transcription initiation complex. While there are many attempts to develop computational promoter identification methods, we have no reliable tool to analyze long genomic sequences.
RESULTS: In this work, we further develop our deep learning approach that was relatively successful to discriminate short promoter and non-promoter sequences. Instead of focusing on the classification accuracy, in this work we predict the exact positions of the transcription start site inside the genomic sequences testing every possible location. We studied human promoters to find effective regions for discrimination and built corresponding deep learning models. These models use adaptively constructed negative set, which iteratively improves the model's discriminative ability. Our method significantly outperforms the previously developed promoter prediction programs by considerably reducing the number of false-positive predictions. We have achieved error-per-1000-bp rate of 0.02 and have 0.31 errors per correct prediction, which is significantly better than the results of other human promoter predictors.
AVAILABILITY AND IMPLEMENTATION: The developed method is available as a web server at http://www.cbrc.kaust.edu.sa/PromID/.

Entities: Species

Mesh：

Year: 2019 PMID： 30601980 DOI： 10.1093/bioinformatics/bty1068

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

20 in total

1. A deep dense inception network for protein beta-turn prediction.

Authors: Chao Fang; Yi Shang; Dong Xu
Journal: Proteins Date: 2019-07-23

2. DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors: Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal: Bioinformatics Date: 2020-02-15 Impact factor: 6.937

3. Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.

Authors: Meng Zhang; Cangzhi Jia; Fuyi Li; Chen Li; Yan Zhu; Tatsuya Akutsu; Geoffrey I Webb; Quan Zou; Lachlan J M Coin; Jiangning Song
Journal: Brief Bioinform Date: 2022-03-10 Impact factor: 11.622

4. Protein-RNA interaction prediction with deep learning: structure matters.

Authors: Junkang Wei; Siyuan Chen; Licheng Zong; Xin Gao; Yu Li
Journal: Brief Bioinform Date: 2022-01-17 Impact factor: 11.622

5. Integrating convolution and self-attention improves language model of human genome for interpreting non-coding regions at base-resolution.

Authors: Meng Yang; Lichao Huang; Haiping Huang; Hui Tang; Nan Zhang; Huanming Yang; Jihong Wu; Feng Mu
Journal: Nucleic Acids Res Date: 2022-08-12 Impact factor: 19.160

6. Investigating the Genomic Background of CRISPR-Cas Genomes for CRISPR-Based Antimicrobials.

Authors: Hyunjin Shim
Journal: Evol Bioinform Online Date: 2022-06-08 Impact factor: 2.031

7. Precise Prediction of Calpain Cleavage Sites and Their Aberrance Caused by Mutations in Cancer.

Authors: Ze-Xian Liu; Kai Yu; Jingsi Dong; Linhong Zhao; Zekun Liu; Qingfeng Zhang; Shihua Li; Yimeng Du; Han Cheng
Journal: Front Genet Date: 2019-08-08 Impact factor: 4.599

8. Model-driven generation of artificial yeast promoters.

Authors: Benjamin J Kotopka; Christina D Smolke
Journal: Nat Commun Date: 2020-04-30 Impact factor: 14.919

9. Identification of Regulatory SNPs Associated with Vicine and Convicine Content of Vicia faba Based on Genotyping by Sequencing Data Using Deep Learning.

Authors: Felix Heinrich; Martin Wutke; Pronaya Prosun Das; Miriam Kamp; Mehmet Gültas; Wolfgang Link; Armin Otto Schmitt
Journal: Genes (Basel) Date: 2020-06-05 Impact factor: 4.096

10. SpliceFinder: ab initio prediction of splice sites using convolutional neural network.

Authors: Ruohan Wang; Zishuai Wang; Jianping Wang; Shuaicheng Li
Journal: BMC Bioinformatics Date: 2019-12-27 Impact factor: 3.169