Literature DB >> 21724591

Tissue-specific prediction of directly regulated genes.

Robert C McLeay1, Chris J Leat, Timothy L Bailey.   

Abstract

UNLABELLED: Direct binding by a transcription factor (TF) to the proximal promoter of a gene is a strong evidence that the TF regulates the gene. Assaying the genome-wide binding of every TF in every cell type and condition is currently impractical. Histone modifications correlate with tissue/cell/condition-specific ('tissue specific') TF binding, so histone ChIP-seq data can be combined with traditional position weight matrix (PWM) methods to make tissue-specific predictions of TF-promoter interactions.
RESULTS: We use supervised learning to train a naïve Bayes predictor of TF-promoter binding. The predictor's features are the histone modification levels and a PWM-based score for the promoter. Training and testing uses sets of promoters labeled using TF ChIP-seq data, and we use cross-validation on 23 such datasets to measure the accuracy. A PWM+histone naïve Bayes predictor using a single histone modification (H3K4me3) is substantially more accurate than a PWM score or a conservation-based score (phylogenetic motif model). The naïve Bayes predictor is more accurate (on average) at all sensitivity levels, and makes only half as many false positive predictions at sensitivity levels from 10% to 80%. On average, it correctly predicts 80% of bound promoters at a false positive rate of 20%. Accuracy does not diminish when we test the predictor in a different cell type (and species) from training. Accuracy is barely diminished even when we train the predictor without using TF ChIP-seq data. AVAILABILITY: Our tissue-specific predictor of promoters bound by a TF is called Dr Gene and is available at http://bioinformatics.org.au/drgene. CONTACT: t.bailey@imb.uq.edu.au SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2011        PMID: 21724591      PMCID: PMC3157924          DOI: 10.1093/bioinformatics/btr399

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  ROCR: visualizing classifier performance in R.

Authors:  Tobias Sing; Oliver Sander; Niko Beerenwinkel; Thomas Lengauer
Journal:  Bioinformatics       Date:  2005-08-11       Impact factor: 6.937

Review 3.  Chromatin modifications and their function.

Authors:  Tony Kouzarides
Journal:  Cell       Date:  2007-02-23       Impact factor: 41.582

4.  High-resolution profiling of histone methylations in the human genome.

Authors:  Artem Barski; Suresh Cuddapah; Kairong Cui; Tae-Young Roh; Dustin E Schones; Zhibin Wang; Gang Wei; Iouri Chepelev; Keji Zhao
Journal:  Cell       Date:  2007-05-18       Impact factor: 41.582

5.  Information content and free energy in DNA--protein interactions.

Authors:  G D Stormo
Journal:  J Theor Biol       Date:  1998-11-07       Impact factor: 2.691

6.  Histone modification levels are predictive for gene expression.

Authors:  Rosa Karlić; Ho-Ryun Chung; Julia Lasserre; Kristian Vlahovicek; Martin Vingron
Journal:  Proc Natl Acad Sci U S A       Date:  2010-02-01       Impact factor: 11.205

Review 7.  Measuring the accuracy of diagnostic systems.

Authors:  J A Swets
Journal:  Science       Date:  1988-06-03       Impact factor: 47.728

8.  The ENCODE Project at UC Santa Cruz.

Authors:  Daryl J Thomas; Kate R Rosenbloom; Hiram Clawson; Angie S Hinrichs; Heather Trumbower; Brian J Raney; Donna Karolchik; Galt P Barber; Rachel A Harte; Jennifer Hillman-Jackson; Robert M Kuhn; Brooke L Rhead; Kayla E Smith; Archana Thakkapallayil; Ann S Zweig; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2006-12-13       Impact factor: 16.971

9.  JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles.

Authors:  Elodie Portales-Casamar; Supat Thongjuea; Andrew T Kwon; David Arenillas; Xiaobei Zhao; Eivind Valen; Dimas Yusuf; Boris Lenhard; Wyeth W Wasserman; Albin Sandelin
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

10.  MONKEY: identifying conserved transcription-factor binding sites in multiple alignments using a binding site-specific evolutionary model.

Authors:  Alan M Moses; Derek Y Chiang; Daniel A Pollard; Venky N Iyer; Michael B Eisen
Journal:  Genome Biol       Date:  2004-11-30       Impact factor: 13.583

View more
  5 in total

1.  Predictive Models of Spatial Transcriptional Response to High Salinity.

Authors:  Sahra Uygun; Alexander E Seddon; Christina B Azodi; Shin-Han Shiu
Journal:  Plant Physiol       Date:  2017-04-03       Impact factor: 8.340

2.  Alternative sigma factor over-expression enables heterologous expression of a type II polyketide biosynthetic pathway in Escherichia coli.

Authors:  David Cole Stevens; Kyle R Conway; Nelson Pearce; Luis Roberto Villegas-Peñaranda; Anthony G Garza; Christopher N Boddy
Journal:  PLoS One       Date:  2013-05-28       Impact factor: 3.240

3.  A modulator based regulatory network for ERα signaling pathway.

Authors:  Heng-Yi Wu; Pengyue Zheng; Guanglong Jiang; Yunlong Liu; Kenneth P Nephew; Tim H M Huang; Lang Li
Journal:  BMC Genomics       Date:  2012-10-26       Impact factor: 3.969

4.  Finding associations among histone modifications using sparse partial correlation networks.

Authors:  Julia Lasserre; Ho-Ryun Chung; Martin Vingron
Journal:  PLoS Comput Biol       Date:  2013-09-05       Impact factor: 4.475

Review 5.  Analysis of Genomic Sequence Motifs for Deciphering Transcription Factor Binding and Transcriptional Regulation in Eukaryotic Cells.

Authors:  Valentina Boeva
Journal:  Front Genet       Date:  2016-02-23       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.