MOTIVATION: Increases in microarray feature density allow the construction of so-called tiling microarrays. These arrays, or sets of arrays, contain probes targeting regions of sequenced genomes at regular genomic intervals. The unbiased nature of this approach allows for the identification of novel transcribed sequences, the localization of transcription factor binding sites (ChIP-chip), and high resolution comparative genomic hybridization, among other uses. These applications are quickly growing in popularity as tiling microarrays become more affordable. To reach maximum utility, the tiling microarray platform needs be developed to the point that 1 nt resolutions are achieved and that we have confidence in individual measurements taken at this fine of resolution. Any biases in tiling array signals must be systematically removed to achieve this goal. RESULTS: Towards this end, we investigated the importance of probe sequence composition on the efficacy of tiling microarrays for identifying novel transcription and transcription factor binding sites. We found that intensities are highly sequence dependent and can greatly influence results. We developed three metrics for assessing this sequence dependence and use them in evaluating existing sequence-based normalizations from the tiling microarray literature. In addition, we applied three new techniques for addressing this problem; one method, adapted from similar work on GeneChip brand microarrays, is based on modeling array signal as a linear function of probe sequence, the second method extends this approach by iterative weighting and re-fitting of the model, and the third technique extrapolates the popular quantile normalization algorithm for between-array normalization to probe sequence space. These three methods perform favorably to existing strategies, based on the metrics defined here. AVAILABILITY: http://tiling.gersteinlab.org/sequence_effects/
MOTIVATION: Increases in microarray feature density allow the construction of so-called tiling microarrays. These arrays, or sets of arrays, contain probes targeting regions of sequenced genomes at regular genomic intervals. The unbiased nature of this approach allows for the identification of novel transcribed sequences, the localization of transcription factor binding sites (ChIP-chip), and high resolution comparative genomic hybridization, among other uses. These applications are quickly growing in popularity as tiling microarrays become more affordable. To reach maximum utility, the tiling microarray platform needs be developed to the point that 1 nt resolutions are achieved and that we have confidence in individual measurements taken at this fine of resolution. Any biases in tiling array signals must be systematically removed to achieve this goal. RESULTS: Towards this end, we investigated the importance of probe sequence composition on the efficacy of tiling microarrays for identifying novel transcription and transcription factor binding sites. We found that intensities are highly sequence dependent and can greatly influence results. We developed three metrics for assessing this sequence dependence and use them in evaluating existing sequence-based normalizations from the tiling microarray literature. In addition, we applied three new techniques for addressing this problem; one method, adapted from similar work on GeneChip brand microarrays, is based on modeling array signal as a linear function of probe sequence, the second method extends this approach by iterative weighting and re-fitting of the model, and the third technique extrapolates the popular quantile normalization algorithm for between-array normalization to probe sequence space. These three methods perform favorably to existing strategies, based on the metrics defined here. AVAILABILITY: http://tiling.gersteinlab.org/sequence_effects/
Authors: David S Johnson; Wei Li; D Benjamin Gordon; Arindam Bhattacharjee; Bo Curry; Jayati Ghosh; Leonardo Brizuela; Jason S Carroll; Myles Brown; Paul Flicek; Christoph M Koch; Ian Dunham; Mark Bieda; Xiaoqin Xu; Peggy J Farnham; Philipp Kapranov; David A Nix; Thomas R Gingeras; Xinmin Zhang; Heather Holster; Nan Jiang; Roland D Green; Jun S Song; Scott A McCuine; Elizabeth Anton; Loan Nguyen; Nathan D Trinklein; Zhen Ye; Keith Ching; David Hawkins; Bing Ren; Peter C Scacheri; Joel Rozowsky; Alexander Karpikov; Ghia Euskirchen; Sherman Weissman; Mark Gerstein; Michael Snyder; Annie Yang; Zarmik Moqtaderi; Heather Hirsch; Hennady P Shulha; Yutao Fu; Zhiping Weng; Kevin Struhl; Richard M Myers; Jason D Lieb; X Shirley Liu Journal: Genome Res Date: 2008-02-07 Impact factor: 9.043
Authors: Martijs J Jonker; Wim C de Leeuw; Marino Marinković; Floyd R A Wittink; Han Rauwerda; Oskar Bruning; Wim A Ensink; Ad C Fluit; C H Boel; Mark de Jong; Timo M Breit Journal: Nucleic Acids Res Date: 2014-04-25 Impact factor: 16.971
Authors: Gard O S Thomassen; Alexander D Rowe; Karin Lagesen; Jessica M Lindvall; Torbjørn Rognes Journal: PLoS One Date: 2009-06-17 Impact factor: 3.240
Authors: Roy R Chaudhuri; Sarah E Peters; Stephen J Pleasance; Helen Northen; Chrissie Willers; Gavin K Paterson; Danielle B Cone; Andrew G Allen; Paul J Owen; Gil Shalom; Dov J Stekel; Ian G Charles; Duncan J Maskell Journal: PLoS Pathog Date: 2009-07-31 Impact factor: 6.823