Literature DB >> 21385042

The poisson margin test for normalization-free significance analysis of NGS data.

Adam Kowalczyk1, Justin Bedo, Thomas Conway, Bryan Beresford-Smith.   

Abstract

The current methods for the determination of the statistical significance of peaks and regions in next generation sequencing (NGS) data require an explicit normalization step to compensate for (global or local) imbalances in the sizes of sequenced and mapped libraries. There are no canonical methods for performing such compensations; hence, a number of different procedures serving this goal in different ways can be found in the literature. Unfortunately, the normalization has a significant impact on the final results. Different methods yield very different numbers of detected "significant peaks" even in the simplest scenario of ChIP-Seq experiments that compare the enrichment in a single sample relative to a matching control. This becomes an even more acute issue in the more general case of the comparison of multiple samples, where a number of arbitrary design choices will be required in the data analysis stage, each option resulting in possibly (significantly) different outcomes. In this article, we investigate a principled statistical procedure that eliminates the need for a normalization step. We outline its basic properties, in particular the scaling upon depth of sequencing. For the sake of illustration and comparison, we report the results of re-analyzing a ChIP-Seq experiment for transcription factor binding site detection. In order to quantify the differences between outcomes, we use a novel method based on the accuracy of in silico prediction by support vector machine (SVM) models trained on part of the genome and tested on the remainder. See Kowalczyk et al. ( 2009 ) for supplementary material.

Mesh:

Year:  2011        PMID: 21385042     DOI: 10.1089/cmb.2010.0272

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  Genome-wide analysis distinguishes hyperglycemia regulated epigenetic signatures of primary vascular cells.

Authors:  Luciano Pirola; Aneta Balcerczyk; Richard W Tothill; Izhak Haviv; Antony Kaspi; Sebastian Lunke; Mark Ziemann; Tom Karagiannis; Stephen Tonna; Adam Kowalczyk; Bryan Beresford-Smith; Geoff Macintyre; Ma Kelong; Zhang Hongyu; Jingde Zhu; Assam El-Osta
Journal:  Genome Res       Date:  2011-09-02       Impact factor: 9.043

2.  Molecular profiling of human mammary gland links breast cancer risk to a p27(+) cell population with progenitor characteristics.

Authors:  Sibgat Choudhury; Vanessa Almendro; Vanessa F Merino; Zhenhua Wu; Reo Maruyama; Ying Su; Filipe C Martins; Mary Jo Fackler; Marina Bessarabova; Adam Kowalczyk; Thomas Conway; Bryan Beresford-Smith; Geoff Macintyre; Yu-Kang Cheng; Zoila Lopez-Bujanda; Antony Kaspi; Rong Hu; Judith Robens; Tatiana Nikolskaya; Vilde D Haakensen; Stuart J Schnitt; Pedram Argani; Gabrielle Ethington; Laura Panos; Michael Grant; Jason Clark; William Herlihy; S Joyce Lin; Grace Chew; Erik W Thompson; April Greene-Colozzi; Andrea L Richardson; Gedge D Rosson; Malcolm Pike; Judy E Garber; Yuri Nikolsky; Joanne L Blum; Alfred Au; E Shelley Hwang; Rulla M Tamimi; Franziska Michor; Izhak Haviv; X Shirley Liu; Saraswati Sukumar; Kornelia Polyak
Journal:  Cell Stem Cell       Date:  2013-06-13       Impact factor: 24.633

3.  RAR/RXR binding dynamics distinguish pluripotency from differentiation associated cis-regulatory elements.

Authors:  Amandine Chatagnon; Philippe Veber; Valérie Morin; Justin Bedo; Gérard Triqueneaux; Marie Sémon; Vincent Laudet; Florence d'Alché-Buc; Gérard Benoit
Journal:  Nucleic Acids Res       Date:  2015-04-20       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.