MOTIVATION: ChIPseq is rapidly becoming a common technique for investigating protein-DNA interactions. However, results from individual experiments provide a limited understanding of chromatin structure, as various chromatin factors cooperate in complex ways to orchestrate transcription. In order to quantify chromtain interactions, it is thus necessary to devise a robust similarity metric applicable to ChIPseq data. Unfortunately, moving past simple overlap calculations to give statistically rigorous comparisons of ChIPseq datasets often involves arbitrary choices of distance metrics, with significance being estimated by computationally intensive permutation tests whose statistical power may be sensitive to non-biological experimental and post-processing variation. RESULTS: We show that it is in fact possible to compare ChIPseq datasets through the efficient computation of exact P-values for proximity. Our method is insensitive to non-biological variation in datasets such as peak width, and can rigorously model peak location biases by evaluating similarity conditioned on a restricted set of genomic regions (such as mappable genome or promoter regions). Applying our method to the well-studied dataset of Chen et al. (2008), we elucidate novel interactions which conform well with our biological understanding. By comparing ChIPseq data in an asymmetric way, we are able to observe clear interaction differences between cofactors such as p300 and factors that bind DNA directly. AVAILABILITY: Source code is available for download at http://sonorus.princeton.edu/IntervalStats/IntervalStats.tar.gz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: ChIPseq is rapidly becoming a common technique for investigating protein-DNA interactions. However, results from individual experiments provide a limited understanding of chromatin structure, as various chromatin factors cooperate in complex ways to orchestrate transcription. In order to quantify chromtain interactions, it is thus necessary to devise a robust similarity metric applicable to ChIPseq data. Unfortunately, moving past simple overlap calculations to give statistically rigorous comparisons of ChIPseq datasets often involves arbitrary choices of distance metrics, with significance being estimated by computationally intensive permutation tests whose statistical power may be sensitive to non-biological experimental and post-processing variation. RESULTS: We show that it is in fact possible to compare ChIPseq datasets through the efficient computation of exact P-values for proximity. Our method is insensitive to non-biological variation in datasets such as peak width, and can rigorously model peak location biases by evaluating similarity conditioned on a restricted set of genomic regions (such as mappable genome or promoter regions). Applying our method to the well-studied dataset of Chen et al. (2008), we elucidate novel interactions which conform well with our biological understanding. By comparing ChIPseq data in an asymmetric way, we are able to observe clear interaction differences between cofactors such as p300 and factors that bind DNA directly. AVAILABILITY: Source code is available for download at http://sonorus.princeton.edu/IntervalStats/IntervalStats.tar.gz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Jianlong Wang; Sridhar Rao; Jianlin Chu; Xiaohua Shen; Dana N Levasseur; Thorold W Theunissen; Stuart H Orkin Journal: Nature Date: 2006-11-08 Impact factor: 49.962
Authors: Zhengdong D Zhang; Alberto Paccanaro; Yutao Fu; Sherman Weissman; Zhiping Weng; Joseph Chang; Michael Snyder; Mark B Gerstein Journal: Genome Res Date: 2007-06 Impact factor: 9.043
Authors: Atsushi Suzuki; Ángel Raya; Yasuhiko Kawakami; Masanobu Morita; Takaaki Matsui; Kinichi Nakashima; Fred H Gage; Concepción Rodríguez-Esteban; Juan Carlos Izpisúa Belmonte Journal: Proc Natl Acad Sci U S A Date: 2006-06-26 Impact factor: 11.205
Authors: Marius Wernig; Alexander Meissner; Ruth Foreman; Tobias Brambrink; Manching Ku; Konrad Hochedlinger; Bradley E Bernstein; Rudolf Jaenisch Journal: Nature Date: 2007-06-06 Impact factor: 49.962
Authors: Tarjei S Mikkelsen; Manching Ku; David B Jaffe; Biju Issac; Erez Lieberman; Georgia Giannoukos; Pablo Alvarez; William Brockman; Tae-Kyung Kim; Richard P Koche; William Lee; Eric Mendenhall; Aisling O'Donovan; Aviva Presser; Carsten Russ; Xiaohui Xie; Alexander Meissner; Marius Wernig; Rudolf Jaenisch; Chad Nusbaum; Eric S Lander; Bradley E Bernstein Journal: Nature Date: 2007-07-01 Impact factor: 49.962
Authors: Miroslav P Ivanov; Rene Ladurner; Ina Poser; Rebecca Beveridge; Evelyn Rampler; Otto Hudecz; Maria Novatchkova; Jean-Karim Hériché; Gordana Wutz; Petra van der Lelij; Emanuel Kreidl; James Ra Hutchins; Heinz Axelsson-Ekker; Jan Ellenberg; Anthony A Hyman; Karl Mechtler; Jan-Michael Peters Journal: EMBO J Date: 2018-06-21 Impact factor: 11.598
Authors: Michelle M Kudron; Alec Victorsen; Louis Gevirtzman; LaDeana W Hillier; William W Fisher; Dionne Vafeados; Matt Kirkey; Ann S Hammonds; Jeffery Gersch; Haneen Ammouri; Martha L Wall; Jennifer Moran; David Steffen; Matt Szynkarek; Samantha Seabrook-Sturgis; Nader Jameel; Madhura Kadaba; Jaeda Patton; Robert Terrell; Mitch Corson; Timothy J Durham; Soo Park; Swapna Samanta; Mei Han; Jinrui Xu; Koon-Kiu Yan; Susan E Celniker; Kevin P White; Lijia Ma; Mark Gerstein; Valerie Reinke; Robert H Waterston Journal: Genetics Date: 2017-12-28 Impact factor: 4.562
Authors: Hideo Watanabe; Joshua M Francis; Michele S Woo; Banafsheh Etemad; Wenchu Lin; Daniel F Fries; Shouyong Peng; Eric L Snyder; Purushothama Rao Tata; Francesca Izzo; Anna C Schinzel; Jeonghee Cho; Peter S Hammerman; Roel G Verhaak; William C Hahn; Jayaraj Rajagopal; Tyler Jacks; Matthew Meyerson Journal: Genes Dev Date: 2013-01-15 Impact factor: 11.361
Authors: Radhika A Varier; Enrique Carrillo de Santa Pau; Petra van der Groep; Rik G H Lindeboom; Filomena Matarese; Anneloes Mensinga; Arne H Smits; Raghu Ram Edupuganti; Marijke P Baltissen; Pascal W T C Jansen; Natalie Ter Hoeve; Danny R van Weely; Ina Poser; Paul J van Diest; Hendrik G Stunnenberg; Michiel Vermeulen Journal: J Biol Chem Date: 2016-02-03 Impact factor: 5.157
Authors: Elena D Stavrovskaya; Tejasvi Niranjan; Elana J Fertig; Sarah J Wheelan; Alexander V Favorov; Andrey A Mironov Journal: Bioinformatics Date: 2017-10-15 Impact factor: 6.937