Literature DB >> 22152084

Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes.

Roland Wittler1, Cedric Chauve.   

Abstract

BACKGROUND: Structural variations in human genomes, such as insertions, deletion, or rearrangements, play an important role in cancer development. Next-Generation Sequencing technologies have been central in providing ways to detect such variations. Most existing methods however are limited to the analysis of a single genome, and it is only recently that the comparison of closely related genomes has been considered. In particular, a few recent works considered the analysis of data sets obtained by sequencing both tumor and healthy tissues of the same cancer patient. In that context, the goal is to detect variations that are specific to exactly one of the genomes, for example to differentiate between patient-specific and tumor-specific variations. This is a difficult task, especially when facing the additional challenge of the possible contamination of healthy tissues by tumor cells and conversely.
RESULTS: In the current work, we analyzed a data set of paired-end short-reads, obtained by sequencing tumor tissues and healthy tissues, both from the same cancer patient. Based on a combinatorial notion of conflict between deletions, we show that in the tumor data, more deletions are predicted than there could actually be in a diploid genome. In contrast, the predictions for the data from normal tissues are almost conflict-free. We designed and applied a method, specific to the analysis of such pooled and contaminated data sets, to detect potential tumor-specific deletions. Our method takes the deletion calls from both data sets and assigns reads from the mixed tumor/normal data to the normal one with the goal to minimize the number of reads that need to be discarded to obtain a set of conflict-free deletion clusters. We observed that, on the specific data set we analyze, only a very small fraction of the reads needs to be discarded to obtain a set of consistent deletions.
CONCLUSIONS: We present a framework based on a rigorous definition of consistency between deletions and the assumption that the tumor sample also contains normal cells. A combined analysis of both data sets based on this model allowed a consistent explanation of almost all data, providing a detailed picture of candidate patient- and tumor-specific deletions.

Entities:  

Mesh:

Year:  2011        PMID: 22152084      PMCID: PMC3283309          DOI: 10.1186/1471-2105-12-S9-S21

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  32 in total

1.  End-sequence profiling: sequence-based analysis of aberrant genomes.

Authors:  Stanislav Volik; Shaying Zhao; Koei Chin; John H Brebner; David R Herndon; Quanzhou Tao; David Kowbel; Guiqing Huang; Anna Lapuk; Wen-Lin Kuo; Gregg Magrane; Pieter De Jong; Joe W Gray; Colin Collins
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-04       Impact factor: 11.205

2.  Fine-scale structural variation of the human genome.

Authors:  Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal:  Nat Genet       Date:  2005-05-15       Impact factor: 38.330

Review 3.  Loss of constitutional heterozygosity in human cancer.

Authors:  D Lasko; W Cavenee; M Nordenskjöld
Journal:  Annu Rev Genet       Date:  1991       Impact factor: 16.830

4.  MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions.

Authors:  Seunghak Lee; Fereydoun Hormozdiari; Can Alkan; Michael Brudno
Journal:  Nat Methods       Date:  2009-05-31       Impact factor: 28.547

5.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

6.  A unified approach for reconstructing ancient gene clusters.

Authors:  Jens Stoye; Roland Wittler
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2009 Jul-Sep       Impact factor: 3.710

7.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads.

Authors:  Kai Ye; Marcel H Schulz; Quan Long; Rolf Apweiler; Zemin Ning
Journal:  Bioinformatics       Date:  2009-06-26       Impact factor: 6.937

8.  A geometric approach for classification and comparison of structural variants.

Authors:  Suzanne Sindi; Elena Helman; Ali Bashir; Benjamin J Raphael
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

9.  Paired-end mapping reveals extensive structural variation in the human genome.

Authors:  Jan O Korbel; Alexander Eckehart Urban; Jason P Affourtit; Brian Godwin; Fabian Grubert; Jan Fredrik Simons; Philip M Kim; Dean Palejev; Nicholas J Carriero; Lei Du; Bruce E Taillon; Zhoutao Chen; Andrea Tanzer; A C Eugenia Saunders; Jianxiang Chi; Fengtang Yang; Nigel P Carter; Matthew E Hurles; Sherman M Weissman; Timothy T Harkins; Mark B Gerstein; Michael Egholm; Michael Snyder
Journal:  Science       Date:  2007-09-27       Impact factor: 47.728

10.  DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome.

Authors:  Timothy J Ley; Elaine R Mardis; Li Ding; Bob Fulton; Michael D McLellan; Ken Chen; David Dooling; Brian H Dunford-Shore; Sean McGrath; Matthew Hickenbotham; Lisa Cook; Rachel Abbott; David E Larson; Dan C Koboldt; Craig Pohl; Scott Smith; Amy Hawkins; Scott Abbott; Devin Locke; Ladeana W Hillier; Tracie Miner; Lucinda Fulton; Vincent Magrini; Todd Wylie; Jarret Glasscock; Joshua Conyers; Nathan Sander; Xiaoqi Shi; John R Osborne; Patrick Minx; David Gordon; Asif Chinwalla; Yu Zhao; Rhonda E Ries; Jacqueline E Payton; Peter Westervelt; Michael H Tomasson; Mark Watson; Jack Baty; Jennifer Ivanovich; Sharon Heath; William D Shannon; Rakesh Nagarajan; Matthew J Walter; Daniel C Link; Timothy A Graubert; John F DiPersio; Richard K Wilson
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

View more
  2 in total

1.  An integrative probabilistic model for identification of structural variation in sequencing data.

Authors:  Suzanne S Sindi; Selim Onal; Luke C Peng; Hsin-Ta Wu; Benjamin J Raphael
Journal:  Genome Biol       Date:  2012       Impact factor: 17.906

2.  Unraveling overlapping deletions by agglomerative clustering.

Authors:  Roland Wittler
Journal:  BMC Genomics       Date:  2013-01-21       Impact factor: 3.969

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.