Literature DB >> 34661103

Characterizing batch effects and binding site-specific variability in ChIP-seq data.

Mingxiang Teng1, Dongliang Du1, Danfeng Chen2, Rafael A Irizarry3.   

Abstract

Multiple sources of variability can bias ChIP-seq data toward inferring transcription factor (TF) binding profiles. As ChIP-seq datasets increase in public repositories, it is now possible and necessary to account for complex sources of variability in ChIP-seq data analysis. We find that two types of variability, the batch effects by sequencing laboratories and differences between biological replicates, not associated with changes in condition or state, vary across genomic sites. This implies that observed differences between samples from different conditions or states, such as cell-type, must be assessed statistically, with an understanding of the distribution of obscuring noise. We present a statistical approach that characterizes both differences of interests and these source of variability through the parameters of a mixed effects model. We demonstrate the utility of our approach on a CTCF binding dataset composed of 211 samples representing 90 different cell-types measured across three different laboratories. The results revealed that sites exhibiting large variability were associated with sequence characteristics such as GC-content and low complexity. Finally, we identified TFs associated with high-variance CTCF sites using TF motifs documented in public databases, pointing the possibility of these being false positives if the sources of variability are not properly accounted for.
© The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2021        PMID: 34661103      PMCID: PMC8515842          DOI: 10.1093/nargab/lqab098

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  47 in total

1.  Normalization of RNA-seq data using factor analysis of control genes or samples.

Authors:  Davide Risso; John Ngai; Terence P Speed; Sandrine Dudoit
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

2.  Identification of a Ctcf cofactor, Yy1, for the X chromosome binary switch.

Authors:  Mary E Donohoe; Li-Feng Zhang; Na Xu; Yang Shi; Jeannie T Lee
Journal:  Mol Cell       Date:  2007-01-12       Impact factor: 17.970

3.  High-resolution profiling of histone methylations in the human genome.

Authors:  Artem Barski; Suresh Cuddapah; Kairong Cui; Tae-Young Roh; Dustin E Schones; Zhibin Wang; Gang Wei; Iouri Chepelev; Keji Zhao
Journal:  Cell       Date:  2007-05-18       Impact factor: 41.582

4.  Highly expressed loci are vulnerable to misleading ChIP localization of multiple unrelated proteins.

Authors:  Leonid Teytelman; Deborah M Thurtle; Jasper Rine; Alexander van Oudenaarden
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-30       Impact factor: 11.205

5.  De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly.

Authors:  Aaron T L Lun; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2014-05-22       Impact factor: 16.971

6.  CTCF-dependent co-localization of canonical Smad signaling factors at architectural protein binding sites in D. melanogaster.

Authors:  Kevin Van Bortle; Aidan J Peterson; Naomi Takenaka; Michael B O'Connor; Victor G Corces
Journal:  Cell Cycle       Date:  2015       Impact factor: 4.534

7.  Detecting and correcting systematic variation in large-scale RNA sequencing data.

Authors:  Sheng Li; Paweł P Łabaj; Paul Zumbo; Peter Sykacek; Wei Shi; Leming Shi; John Phan; Po-Yen Wu; May Wang; Charles Wang; Danielle Thierry-Mieg; Jean Thierry-Mieg; David P Kreil; Christopher E Mason
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

8.  JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework.

Authors:  Aziz Khan; Oriol Fornes; Arnaud Stigliani; Marius Gheorghe; Jaime A Castro-Mondragon; Robin van der Lee; Adrien Bessy; Jeanne Chèneby; Shubhada R Kulkarni; Ge Tan; Damir Baranasic; David J Arenillas; Albin Sandelin; Klaas Vandepoele; Boris Lenhard; Benoît Ballester; Wyeth W Wasserman; François Parcy; Anthony Mathelier
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

10.  TFBSTools: an R/bioconductor package for transcription factor binding site analysis.

Authors:  Ge Tan; Boris Lenhard
Journal:  Bioinformatics       Date:  2016-01-21       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.