Literature DB >> 24966364

CNV-guided multi-read allocation for ChIP-seq.

Qi Zhang1, Sündüz Keleş2.   

Abstract

MOTIVATION: In chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) and other short-read sequencing experiments, a considerable fraction of the short reads align to multiple locations on the reference genome (multi-reads). Inferring the origin of multi-reads is critical for accurately mapping reads to repetitive regions. Current state-of-the-art multi-read allocation algorithms rely on the read counts in the local neighborhood of the alignment locations and ignore the variation in the copy numbers of these regions. Copy-number variation (CNV) can directly affect the read densities and, therefore, bias allocation of multi-reads.
RESULTS: We propose cnvCSEM (CNV-guided ChIP-Seq by expectation-maximization algorithm), a flexible framework that incorporates CNV in multi-read allocation. cnvCSEM eliminates the CNV bias in multi-read allocation by initializing the read allocation algorithm with CNV-aware initial values. Our data-driven simulations illustrate that cnvCSEM leads to higher read coverage with satisfactory accuracy and lower loss in read-depth recovery (estimation). We evaluate the biological relevance of the cnvCSEM-allocated reads and the resultant peaks with the analysis of several ENCODE ChIP-seq datasets.
AVAILABILITY AND IMPLEMENTATION: Available at http://www.stat.wisc.edu/∼qizhang/ CONTACT: : qizhang@stat.wisc.edu or keles@stat.wisc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2014        PMID: 24966364      PMCID: PMC4184254          DOI: 10.1093/bioinformatics/btu402

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  Recent segmental duplications in the human genome.

Authors:  Jeffrey A Bailey; Zhiping Gu; Royden A Clark; Knut Reinert; Rhea V Samonte; Stuart Schwartz; Mark D Adams; Eugene W Myers; Peter W Li; Evan E Eichler
Journal:  Science       Date:  2002-08-09       Impact factor: 47.728

2.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing.

Authors:  Alexej Abyzov; Alexander E Urban; Michael Snyder; Mark Gerstein
Journal:  Genome Res       Date:  2011-02-07       Impact factor: 9.043

3.  DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells.

Authors:  Lingyun Song; Gregory E Crawford
Journal:  Cold Spring Harb Protoc       Date:  2010-02

4.  A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags.

Authors:  Jianrong Wang; Ahsan Huda; Victoria V Lunyak; I King Jordan
Journal:  Bioinformatics       Date:  2010-09-24       Impact factor: 6.937

5.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

6.  HMCan: a method for detecting chromatin modifications in cancer samples using ChIP-seq data.

Authors:  Haitham Ashoor; Aurélie Hérault; Aurélie Kamoun; François Radvanyi; Vladimir B Bajic; Emmanuel Barillot; Valentina Boeva
Journal:  Bioinformatics       Date:  2013-09-09       Impact factor: 6.937

7.  MEME SUITE: tools for motif discovery and searching.

Authors:  Timothy L Bailey; Mikael Boden; Fabian A Buske; Martin Frith; Charles E Grant; Luca Clementi; Jingyuan Ren; Wilfred W Li; William S Noble
Journal:  Nucleic Acids Res       Date:  2009-05-20       Impact factor: 16.971

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting.

Authors:  Jay R Hesselberth; Xiaoyu Chen; Zhihong Zhang; Peter J Sabo; Richard Sandstrom; Alex P Reynolds; Robert E Thurman; Shane Neph; Michael S Kuehn; William S Noble; Stanley Fields; John A Stamatoyannopoulos
Journal:  Nat Methods       Date:  2009-03-22       Impact factor: 28.547

10.  Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Authors:  Peter V Kharchenko; Michael Y Tolstorukov; Peter J Park
Journal:  Nat Biotechnol       Date:  2008-11-16       Impact factor: 54.908

View more
  4 in total

1.  Systematic evaluation of the impact of ChIP-seq read designs on genome coverage, peak identification, and allele-specific binding detection.

Authors:  Qi Zhang; Xin Zeng; Sam Younkin; Trupti Kawli; Michael P Snyder; Sündüz Keleş
Journal:  BMC Bioinformatics       Date:  2016-02-24       Impact factor: 3.169

Review 2.  Computational Oncology in the Multi-Omics Era: State of the Art.

Authors:  Guillermo de Anda-Jáuregui; Enrique Hernández-Lemus
Journal:  Front Oncol       Date:  2020-04-07       Impact factor: 6.244

3.  Generative modeling of multi-mapping reads with mHi-C advances analysis of Hi-C studies.

Authors:  Ye Zheng; Ferhat Ay; Sunduz Keles
Journal:  Elife       Date:  2019-01-31       Impact factor: 8.140

4.  Sequential regulatory activity prediction across chromosomes with convolutional neural networks.

Authors:  David R Kelley; Yakir A Reshef; Maxwell Bileschi; David Belanger; Cory Y McLean; Jasper Snoek
Journal:  Genome Res       Date:  2018-03-27       Impact factor: 9.043

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.