Wei Li1, Clifford A Meyer, X Shirley Liu. 1. Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Harvard School of Public Health Boston, MA 02115, USA.
Abstract
MOTIVATION: Transcription factors (TFs) regulate gene expression by recognizing and binding to specific regulatory regions on the genome, which in higher eukaryotes can occur far away from the regulated genes. Recently, Affymetrix developed the high-density oligonucleotide arrays that tile all the non-repetitive sequences of the human genome at 35 bp resolution. This new array platform allows for the unbiased mapping of in vivo TF binding sequences (TFBSs) using Chromatin ImmunoPrecipitation followed by microarray experiments (ChIP-chip). The massive dataset generated from these experiments pose great challenges for data analysis. RESULTS: We developed a fast, scalable and sensitive method to extract TFBSs from ChIP-chip experiments on genome tiling arrays. Our method takes advantage of tiling array data from many experiments to normalize and model the behavior of each individual probe, and identifies TFBSs using a hidden Markov model (HMM). When applied to the data of p53 ChIP-chip experiments from an earlier study, our method discovered many new high confidence p53 targets including all the regions verified by quantitative PCR. Using a de novo motif finding algorithm MDscan, we also recovered the p53 motif from our HMM identified p53 target regions. Furthermore, we found substantial p53 motif enrichment in these regions comparing with both genomic background and the TFBSs identified earlier. Several of the newly identified p53 TFBSs are in the promoter region of known genes or associated with previously characterized p53-responsive genes. SUPPLEMENTARY INFORMATION: Available at the following URL http://genome.dfci.harvard.edu/~xsliu/HMMTiling/index.html.
MOTIVATION: Transcription factors (TFs) regulate gene expression by recognizing and binding to specific regulatory regions on the genome, which in higher eukaryotes can occur far away from the regulated genes. Recently, Affymetrix developed the high-density oligonucleotide arrays that tile all the non-repetitive sequences of the human genome at 35 bp resolution. This new array platform allows for the unbiased mapping of in vivo TF binding sequences (TFBSs) using Chromatin ImmunoPrecipitation followed by microarray experiments (ChIP-chip). The massive dataset generated from these experiments pose great challenges for data analysis. RESULTS: We developed a fast, scalable and sensitive method to extract TFBSs from ChIP-chip experiments on genome tiling arrays. Our method takes advantage of tiling array data from many experiments to normalize and model the behavior of each individual probe, and identifies TFBSs using a hidden Markov model (HMM). When applied to the data of p53 ChIP-chip experiments from an earlier study, our method discovered many new high confidence p53 targets including all the regions verified by quantitative PCR. Using a de novo motif finding algorithm MDscan, we also recovered the p53 motif from our HMM identified p53 target regions. Furthermore, we found substantial p53 motif enrichment in these regions comparing with both genomic background and the TFBSs identified earlier. Several of the newly identified p53 TFBSs are in the promoter region of known genes or associated with previously characterized p53-responsive genes. SUPPLEMENTARY INFORMATION: Available at the following URL http://genome.dfci.harvard.edu/~xsliu/HMMTiling/index.html.
Authors: W Evan Johnson; Wei Li; Clifford A Meyer; Raphael Gottardo; Jason S Carroll; Myles Brown; X Shirley Liu Journal: Proc Natl Acad Sci U S A Date: 2006-08-08 Impact factor: 11.205
Authors: Olof Emanuelsson; Ugrappa Nagalakshmi; Deyou Zheng; Joel S Rozowsky; Alexander E Urban; Jiang Du; Zheng Lian; Viktor Stolc; Sherman Weissman; Michael Snyder; Mark B Gerstein Journal: Genome Res Date: 2006-11-21 Impact factor: 9.043
Authors: J D Lieb; S Beck; M L Bulyk; P Farnham; N Hattori; S Henikoff; X S Liu; K Okumura; K Shiota; T Ushijima; J M Greally Journal: Cytogenet Genome Res Date: 2006 Impact factor: 1.636
Authors: Pantelis Hatzis; Laurens G van der Flier; Marc A van Driel; Victor Guryev; Fiona Nielsen; Sergei Denissov; Isaäc J Nijman; Jan Koster; Evan E Santo; Willem Welboren; Rogier Versteeg; Edwin Cuppen; Marc van de Wetering; Hans Clevers; Hendrik G Stunnenberg Journal: Mol Cell Biol Date: 2008-02-11 Impact factor: 4.272
Authors: Adam A Margolin; Teresa Palomero; Pavel Sumazin; Andrea Califano; Adolfo A Ferrando; Gustavo Stolovitzky Journal: Proc Natl Acad Sci U S A Date: 2008-12-31 Impact factor: 11.205