Literature DB >> 36161334

iPro-WAEL: a comprehensive and robust framework for identifying promoters in multiple species.

Pengyu Zhang1,2, Hongming Zhang2, Hao Wu1.   

Abstract

Promoters are consensus DNA sequences located near the transcription start sites and they play an important role in transcription initiation. Due to their importance in biological processes, the identification of promoters is significantly important for characterizing the expression of the genes. Numerous computational methods have been proposed to predict promoters. However, it is difficult for these methods to achieve satisfactory performance in multiple species. In this study, we propose a novel weighted average ensemble learning model, termed iPro-WAEL, for identifying promoters in multiple species, including Human, Mouse, E.coli, Arabidopsis, B.amyloliquefaciens, B.subtilis and R.capsulatus. Extensive benchmarking experiments illustrate that iPro-WAEL has optimal performance and is superior to the current methods in promoter prediction. The experimental results also demonstrate a satisfactory prediction ability of iPro-WAEL on cross-cell lines, promoters annotated by other methods and distinguishing between promoters and enhancers. Moreover, we identify the most important transcription factor binding site (TFBS) motif in promoter regions to facilitate the study of identifying important motifs in the promoter regions. The source code of iPro-WAEL is freely available at https://github.com/HaoWuLab-Bioinformatics/iPro-WAEL.
© The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36161334      PMCID: PMC9561371          DOI: 10.1093/nar/gkac824

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   19.160


  54 in total

1.  ChromHMM: automating chromatin-state discovery and characterization.

Authors:  Jason Ernst; Manolis Kellis
Journal:  Nat Methods       Date:  2012-02-28       Impact factor: 28.547

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

3.  Energetic contributions to the initiation of transcription in E. coli.

Authors:  Jayanthi Ramprakash; Frederick P Schwarz
Journal:  Biophys Chem       Date:  2008-09-18       Impact factor: 2.352

4.  Transcription factors mediate long-range enhancer-promoter interactions.

Authors:  Ilias K Nolis; Daniel J McKay; Eva Mantouvalou; Stavros Lomvardas; Menie Merika; Dimitris Thanos
Journal:  Proc Natl Acad Sci U S A       Date:  2009-11-18       Impact factor: 11.205

5.  Unsupervised pattern discovery in human chromatin structure through genomic segmentation.

Authors:  Michael M Hoffman; Orion J Buske; Jie Wang; Zhiping Weng; Jeff A Bilmes; William Stafford Noble
Journal:  Nat Methods       Date:  2012-03-18       Impact factor: 28.547

6.  HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.

Authors:  Ivan V Kulakovskiy; Ilya E Vorontsov; Ivan S Yevshin; Ruslan N Sharipov; Alla D Fedorova; Eugene I Rumynskiy; Yulia A Medvedeva; Arturo Magana-Mora; Vladimir B Bajic; Dmitry A Papatsenko; Fedor A Kolpakov; Vsevolod J Makeev
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

7.  Promotech: a general tool for bacterial promoter recognition.

Authors:  Ruben Chevez-Guardado; Lourdes Peña-Castillo
Journal:  Genome Biol       Date:  2021-11-17       Impact factor: 13.583

8.  Integrative detection and analysis of structural variation in cancer genomes.

Authors:  Jesse R Dixon; Jie Xu; Vishnu Dileep; Ye Zhan; Fan Song; Victoria T Le; Galip Gürkan Yardımcı; Abhijit Chakraborty; Darrin V Bann; Yanli Wang; Royden Clark; Lijun Zhang; Hongbo Yang; Tingting Liu; Sriranga Iyyanki; Lin An; Christopher Pool; Takayo Sasaki; Juan Carlos Rivera-Mulia; Hakan Ozadam; Bryan R Lajoie; Rajinder Kaul; Michael Buckley; Kristen Lee; Morgan Diegel; Dubravka Pezic; Christina Ernst; Suzana Hadjur; Duncan T Odom; John A Stamatoyannopoulos; James R Broach; Ross C Hardison; Ferhat Ay; William Stafford Noble; Job Dekker; David M Gilbert; Feng Yue
Journal:  Nat Genet       Date:  2018-09-10       Impact factor: 38.330

9.  Deletion of transcription factor binding motifs using the CRISPR/spCas9 system in the β-globin LCR.

Authors:  Yea Woon Kim; AeRi Kim
Journal:  Biosci Rep       Date:  2017-07-20       Impact factor: 3.840

Review 10.  Mammalian RNA polymerase II core promoters: insights from genome-wide studies.

Authors:  Albin Sandelin; Piero Carninci; Boris Lenhard; Jasmina Ponjavic; Yoshihide Hayashizaki; David A Hume
Journal:  Nat Rev Genet       Date:  2007-05-08       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.