Literature DB >> 20833816

A binary search approach to whole-genome data analysis.

Leonid Brodsky1, Simon Kogan, Eshel Benjacob, Eviatar Nevo.   

Abstract

A sequence analysis-oriented binary search-like algorithm was transformed to a sensitive and accurate analysis tool for processing whole-genome data. The advantage of the algorithm over previous methods is its ability to detect the margins of both short and long genome fragments, enriched by up-regulated signals, at equal accuracy. The score of an enriched genome fragment reflects the difference between the actual concentration of up-regulated signals in the fragment and the chromosome signal baseline. The "divide-and-conquer"-type algorithm detects a series of nonintersecting fragments of various lengths with locally optimal scores. The procedure is applied to detected fragments in a nested manner by recalculating the lower-than-baseline signals in the chromosome. The algorithm was applied to simulated whole-genome data, and its sensitivity/specificity were compared with those of several alternative algorithms. The algorithm was also tested with four biological tiling array datasets comprising Arabidopsis (i) expression and (ii) histone 3 lysine 27 trimethylation CHIP-on-chip datasets; Saccharomyces cerevisiae (iii) spliced intron data and (iv) chromatin remodeling factor binding sites. The analyses' results demonstrate the power of the algorithm in identifying both the short up-regulated fragments (such as exons and transcription factor binding sites) and the long--even moderately up-regulated zones--at their precise genome margins. The algorithm generates an accurate whole-genome landscape that could be used for cross-comparison of signals across the same genome in evolutionary and general genomic studies.

Entities:  

Mesh:

Year:  2010        PMID: 20833816      PMCID: PMC2947915          DOI: 10.1073/pnas.1011134107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  12 in total

1.  Applications of recursive segmentation to the analysis of DNA sequences.

Authors:  Wentian Li; Pedro Bernaola-Galván; Fatameh Haghighi; Ivo Grosse
Journal:  Comput Chem       Date:  2002-07

2.  TileMap: create chromosomal map of tiling array hybridizations.

Authors:  Hongkai Ji; Wing Hung Wong
Journal:  Bioinformatics       Date:  2005-07-26       Impact factor: 6.937

3.  Genome sequencing in microfabricated high-density picolitre reactors.

Authors:  Marcel Margulies; Michael Egholm; William E Altman; Said Attiya; Joel S Bader; Lisa A Bemben; Jan Berka; Michael S Braverman; Yi-Ju Chen; Zhoutao Chen; Scott B Dewell; Lei Du; Joseph M Fierro; Xavier V Gomes; Brian C Godwin; Wen He; Scott Helgesen; Chun Heen Ho; Chun He Ho; Gerard P Irzyk; Szilveszter C Jando; Maria L I Alenquer; Thomas P Jarvie; Kshama B Jirage; Jong-Bum Kim; James R Knight; Janna R Lanza; John H Leamon; Steven M Lefkowitz; Ming Lei; Jing Li; Kenton L Lohman; Hong Lu; Vinod B Makhijani; Keith E McDade; Michael P McKenna; Eugene W Myers; Elizabeth Nickerson; John R Nobile; Ramona Plant; Bernard P Puc; Michael T Ronan; George T Roth; Gary J Sarkis; Jan Fredrik Simons; John W Simpson; Maithreyan Srinivasan; Karrie R Tartaro; Alexander Tomasz; Kari A Vogt; Greg A Volkmer; Shally H Wang; Yong Wang; Michael P Weiner; Pengguang Yu; Richard F Begley; Jonathan M Rothberg
Journal:  Nature       Date:  2005-07-31       Impact factor: 49.962

4.  Model-based analysis of tiling-arrays for ChIP-chip.

Authors:  W Evan Johnson; Wei Li; Clifford A Meyer; Raphael Gottardo; Jason S Carroll; Myles Brown; X Shirley Liu
Journal:  Proc Natl Acad Sci U S A       Date:  2006-08-08       Impact factor: 11.205

5.  ChIP-seq: welcome to the new frontier.

Authors:  Elaine R Mardis
Journal:  Nat Methods       Date:  2007-08       Impact factor: 28.547

6.  Construction of the full local similarity map for two biopolymers.

Authors:  A M Leontovich; L I Brodsky; A E Gorbalenya
Journal:  Biosystems       Date:  1993       Impact factor: 1.973

7.  Genome-wide identification of spliced introns using a tiling microarray.

Authors:  Zhihong Zhang; Jay R Hesselberth; Stanley Fields
Journal:  Genome Res       Date:  2007-03-09       Impact factor: 9.043

8.  A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection.

Authors:  Kousuke Hanada; Xu Zhang; Justin O Borevitz; Wen-Hsiung Li; Shin-Han Shiu
Journal:  Genome Res       Date:  2007-03-29       Impact factor: 9.043

9.  Whole-genome analysis of histone H3 lysine 27 trimethylation in Arabidopsis.

Authors:  Xiaoyu Zhang; Oliver Clarenz; Shawn Cokus; Yana V Bernatavichute; Matteo Pellegrini; Justin Goodrich; Steven E Jacobsen
Journal:  PLoS Biol       Date:  2007-05       Impact factor: 8.029

10.  Model-based analysis of ChIP-Seq (MACS).

Authors:  Yong Zhang; Tao Liu; Clifford A Meyer; Jérôme Eeckhoute; David S Johnson; Bradley E Bernstein; Chad Nusbaum; Richard M Myers; Myles Brown; Wei Li; X Shirley Liu
Journal:  Genome Biol       Date:  2008-09-17       Impact factor: 13.583

View more
  3 in total

1.  "Evolution Canyon," a potential microscale monitor of global warming across life.

Authors:  Eviatar Nevo
Journal:  Proc Natl Acad Sci U S A       Date:  2012-02-03       Impact factor: 11.205

2.  Evolution under environmental stress at macro- and microscales.

Authors:  Eviatar Nevo
Journal:  Genome Biol Evol       Date:  2011       Impact factor: 3.416

3.  An Association Study of DNA Methylation and Gene Expression in Angelman Syndrome: A Bioinformatics Approach.

Authors:  Julia Panov; Hanoch Kaphzan
Journal:  Int J Mol Sci       Date:  2022-08-15       Impact factor: 6.208

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.