| Literature DB >> 26599758 |
Elena Grassi1, Ettore Zapparoli1, Ivan Molineris1, Paolo Provero1,2.
Abstract
Transcription factors regulate gene expression by binding regulatory DNA. Understanding the rules governing such binding is an essential step in describing the network of regulatory interactions, and its pathological alterations. We show that describing regulatory regions in terms of their profile of total binding affinities for transcription factors leads to increased predictive power compared to methods based on the identification of discrete binding sites. This applies both to the prediction of transcription factor binding as revealed by ChIP-seq experiments and to the prediction of gene expression through RNA-seq. Further significant improvements in predictive power are obtained when regulatory regions are defined based on chromatin states inferred from histone modification data.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26599758 PMCID: PMC4658012 DOI: 10.1371/journal.pone.0143627
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1(A) Boxplot of the AUC of 188 PWMs using TBA and occupancy at increasing values of the score cutoff. The white bar shows the distribution of AUCs obtained with MotifLocator [25], using the sum of the scores of the sites above an 80% cutoff as sequence score. (B) Distribution of the difference in AUC between TBA and occupancy at cutoff 0.1 for the same PWMs. The P-value is obtained with a paired Wilcoxon test.
Fig 2(A) Fraction of gene expression variance explained by models based on TBA and on occupancy with 80% cutoff on the score in nine cell lines. (B) Same as a function of the cutoff for human embryonic stem cells.
Fig 3Fraction of gene expression variance explained by models based on chromatin HMM data for each cell line, chromatin HMM data for K562 cells, and TBA alone.