Literature DB >> 29617876

A novel method for improved accuracy of transcription factor binding site prediction.

Abdullah M Khamis1, Olaa Motwalli1, Romina Oliva1,2, Boris R Jankovic1, Yulia A Medvedeva1,3,4,5, Haitham Ashoor1, Magbubah Essack1, Xin Gao1, Vladimir B Bajic1.   

Abstract

Identifying transcription factor (TF) binding sites (TFBSs) is important in the computational inference of gene regulation. Widely used computational methods of TFBS prediction based on position weight matrices (PWMs) usually have high false positive rates. Moreover, computational studies of transcription regulation in eukaryotes frequently require numerous PWM models of TFBSs due to a large number of TFs involved. To overcome these problems we developed DRAF, a novel method for TFBS prediction that requires only 14 prediction models for 232 human TFs, while at the same time significantly improves prediction accuracy. DRAF models use more features than PWM models, as they combine information from TFBS sequences and physicochemical properties of TF DNA-binding domains into machine learning models. Evaluation of DRAF on 98 human ChIP-seq datasets shows on average 1.54-, 1.96- and 5.19-fold reduction of false positives at the same sensitivities compared to models from HOCOMOCO, TRANSFAC and DeepBind, respectively. This observation suggests that one can efficiently replace the PWM models for TFBS prediction by a small number of DRAF models that significantly improve prediction accuracy. The DRAF method is implemented in a web tool and in a stand-alone software freely available at http://cbrc.kaust.edu.sa/DRAF.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29617876      PMCID: PMC6037060          DOI: 10.1093/nar/gky237

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  52 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  Identification and functional modelling of DNA sequence elements of transcription.

Authors:  T Werner
Journal:  Brief Bioinform       Date:  2000-11       Impact factor: 11.622

3.  Identification of transcription factor binding sites with variable-order Bayesian networks.

Authors:  I Ben-Gal; A Shani; A Gohr; J Grau; S Arviv; A Shmilovici; S Posch; I Grosse
Journal:  Bioinformatics       Date:  2005-03-29       Impact factor: 6.937

4.  A novel computational method to predict transcription factor DNA binding preference.

Authors:  Ziliang Qian; Yu-Dong Cai; Yixue Li
Journal:  Biochem Biophys Res Commun       Date:  2006-08-01       Impact factor: 3.575

5.  A novel computational approach to predict transcription factor DNA binding preference.

Authors:  Yudong Cai; Jianfeng He; Xinlei Li; Lin Lu; Xinyi Yang; Kaiyan Feng; Wencong Lu; Xiangyin Kong
Journal:  J Proteome Res       Date:  2009-02       Impact factor: 4.466

6.  Varying levels of complexity in transcription factor binding motifs.

Authors:  Jens Keilwagen; Jan Grau
Journal:  Nucleic Acids Res       Date:  2015-06-26       Impact factor: 16.971

7.  Creating PWMs of transcription factors using 3D structure-based computation of protein-DNA free binding energies.

Authors:  Denitsa Alamanova; Philip Stegmaier; Alexander Kel
Journal:  BMC Bioinformatics       Date:  2010-05-03       Impact factor: 3.169

8.  Tree-based position weight matrix approach to model transcription factor binding site profiles.

Authors:  Yingtao Bi; Hyunsoo Kim; Ravi Gupta; Ramana V Davuluri
Journal:  PLoS One       Date:  2011-09-02       Impact factor: 3.240

9.  Improved regulatory element prediction based on tissue-specific local epigenomic signatures.

Authors:  Yupeng He; David U Gorkin; Diane E Dickel; Joseph R Nery; Rosa G Castanon; Ah Young Lee; Yin Shen; Axel Visel; Len A Pennacchio; Bing Ren; Joseph R Ecker
Journal:  Proc Natl Acad Sci U S A       Date:  2017-02-13       Impact factor: 11.205

10.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Authors:  Anthony Mathelier; Oriol Fornes; David J Arenillas; Chih-Yu Chen; Grégoire Denay; Jessica Lee; Wenqiang Shi; Casper Shyr; Ge Tan; Rebecca Worsley-Hunt; Allen W Zhang; François Parcy; Boris Lenhard; Albin Sandelin; Wyeth W Wasserman
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  5 in total

1.  DNA Motif Recognition Modeling from Protein Sequences.

Authors:  Ka-Chun Wong
Journal:  iScience       Date:  2018-09-10

2.  CiiiDER: A tool for predicting and analysing transcription factor binding sites.

Authors:  Linden J Gearing; Helen E Cumming; Ross Chapman; Alexander M Finkel; Isaac B Woodhouse; Kevin Luu; Jodee A Gould; Samuel C Forster; Paul J Hertzog
Journal:  PLoS One       Date:  2019-09-04       Impact factor: 3.240

3.  Whole-genome sequencing reveals insights into the adaptation of French Charolais cattle to Cuban tropical conditions.

Authors:  Lino C Ramírez-Ayala; Dominique Rocha; Sebas E Ramos-Onsins; Jordi Leno-Colorado; Mathieu Charles; Olivier Bouchez; Yoel Rodríguez-Valera; Miguel Pérez-Enciso; Yuliaxis Ramayo-Caldas
Journal:  Genet Sel Evol       Date:  2021-01-04       Impact factor: 4.297

4.  Sequence-based GWAS and post-GWAS analyses reveal a key role of SLC37A1, ANKH, and regulatory regions on bovine milk mineral content.

Authors:  Marie-Pierre Sanchez; Dominique Rocha; Mathieu Charles; Mekki Boussaha; Chris Hozé; Mickaël Brochard; Agnès Delacroix-Buchet; Philippe Grosperrin; Didier Boichard
Journal:  Sci Rep       Date:  2021-04-06       Impact factor: 4.379

5.  Plant-DTI: Extending the landscape of TF protein and DNA interaction in plants by a machine learning-based approach.

Authors:  Bhukrit Ruengsrichaiya; Chakarida Nukoolkit; Saowalak Kalapanulak; Treenut Saithong
Journal:  Front Plant Sci       Date:  2022-08-23       Impact factor: 6.627

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.