Literature DB >> 22533622

A generalized linear model for peak calling in ChIP-Seq data.

Jialin Xu1, Yu Zhang.   

Abstract

Chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) has become a routine for detecting genome-wide protein-DNA interaction. The success of ChIP-Seq data analysis highly depends on the quality of peak calling (i.e., to detect peaks of tag counts at a genomic location and evaluate if the peak corresponds to a real protein-DNA interaction event). The challenges in peak calling include (1) how to combine the forward and the reverse strand tag data to improve the power of peak calling and (2) how to account for the variation of tag data observed across different genomic locations. We introduce a new peak calling method based on the generalized linear model (GLMNB) that utilizes negative binomial distribution to model the tag count data and account for the variation of background tags that may randomly bind to the DNA sequence at varying levels due to local genomic structures and sequence contents. We allow local shifting of peaks observed on the forward and the reverse stands, such that at each potential binding site, a binding profile representing the pattern of a real peak signal is fitted to best explain the observed tag data with maximum likelihood. Our method can also detect multiple peaks within a local region if there are multiple binding sites in the region.

Mesh:

Substances:

Year:  2012        PMID: 22533622      PMCID: PMC3375645          DOI: 10.1089/cmb.2012.0023

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  10 in total

Review 1.  Computation for ChIP-seq and RNA-seq studies.

Authors:  Shirley Pepke; Barbara Wold; Ali Mortazavi
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

2.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities.

Authors:  Sven Heinz; Christopher Benner; Nathanael Spann; Eric Bertolino; Yin C Lin; Peter Laslo; Jason X Cheng; Cornelis Murre; Harinder Singh; Christopher K Glass
Journal:  Mol Cell       Date:  2010-05-28       Impact factor: 17.970

3.  Insulin-induced phosphorylation of FKHR (Foxo1) targets to proteasomal degradation.

Authors:  Hitomi Matsuzaki; Hiroaki Daitoku; Mitsutoki Hatta; Keiji Tanaka; Akiyoshi Fukamizu
Journal:  Proc Natl Acad Sci U S A       Date:  2003-09-17       Impact factor: 11.205

4.  Evaluation of algorithm performance in ChIP-seq peak detection.

Authors:  Elizabeth G Wilbanks; Marc T Facciotti
Journal:  PLoS One       Date:  2010-07-08       Impact factor: 3.240

5.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

6.  BayesPeak: Bayesian analysis of ChIP-seq data.

Authors:  Christiana Spyrou; Rory Stark; Andy G Lynch; Simon Tavaré
Journal:  BMC Bioinformatics       Date:  2009-09-21       Impact factor: 3.169

7.  Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data.

Authors:  Anton Valouev; David S Johnson; Andreas Sundquist; Catherine Medina; Elizabeth Anton; Serafim Batzoglou; Richard M Myers; Arend Sidow
Journal:  Nat Methods       Date:  2008-09       Impact factor: 28.547

8.  An integrated software system for analyzing ChIP-chip and ChIP-seq data.

Authors:  Hongkai Ji; Hui Jiang; Wenxiu Ma; David S Johnson; Richard M Myers; Wing H Wong
Journal:  Nat Biotechnol       Date:  2008-11-02       Impact factor: 54.908

9.  Model-based analysis of ChIP-Seq (MACS).

Authors:  Yong Zhang; Tao Liu; Clifford A Meyer; Jérôme Eeckhoute; David S Johnson; Bradley E Bernstein; Chad Nusbaum; Richard M Myers; Myles Brown; Wei Li; X Shirley Liu
Journal:  Genome Biol       Date:  2008-09-17       Impact factor: 13.583

10.  Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Authors:  Peter V Kharchenko; Michael Y Tolstorukov; Peter J Park
Journal:  Nat Biotechnol       Date:  2008-11-16       Impact factor: 54.908

  10 in total
  4 in total

1.  A randomized approach to speed up the analysis of large-scale read-count data in the application of CNV detection.

Authors:  WeiBo Wang; Wei Sun; Wei Wang; Jin Szatkiewicz
Journal:  BMC Bioinformatics       Date:  2018-03-01       Impact factor: 3.169

Review 2.  A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

Authors:  Ngoc Tam L Tran; Chun-Hsi Huang
Journal:  Biol Direct       Date:  2014-02-20       Impact factor: 4.540

3.  Argonaute CLIP-Seq reveals miRNA targetome diversity across tissue types.

Authors:  Peter M Clark; Phillipe Loher; Kevin Quann; Jonathan Brody; Eric R Londin; Isidore Rigoutsos
Journal:  Sci Rep       Date:  2014-08-08       Impact factor: 4.379

4.  NGS-Integrator: An efficient tool for combining multiple NGS data tracks using minimum Bayes' factors.

Authors:  Bronte Wen; Hyun Jun Jung; Lihe Chen; Fahad Saeed; Mark A Knepper
Journal:  BMC Genomics       Date:  2020-11-19       Impact factor: 3.969

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.