Literature DB >> 19447964

Overlapping pools for high-throughput targeted resequencing.

Snehit Prabhu1, Itsik Pe'er.   

Abstract

Resequencing genomic DNA from pools of individuals is an effective strategy to detect new variants in targeted regions and compare them between cases and controls. There are numerous ways to assign individuals to the pools on which they are to be sequenced. The naïve, disjoint pooling scheme (many individuals to one pool) in predominant use today offers insight into allele frequencies, but does not offer the identity of an allele carrier. We present a framework for overlapping pool design, where each individual sample is resequenced in several pools (many individuals to many pools). Upon discovering a variant, the set of pools where this variant is observed reveals the identity of its carrier. We formalize the mathematical framework for such pool designs and list the requirements from such designs. We specifically address three practical concerns for pooled resequencing designs: (1) false-positives due to errors introduced during amplification and sequencing; (2) false-negatives due to undersampling particular alleles aggravated by nonuniform coverage; and consequently, (3) ambiguous identification of individual carriers in the presence of errors. We build on theory of error-correcting codes to design pools that overcome these pitfalls. We show that in practical parameters of resequencing studies, our designs guarantee high probability of unambiguous singleton carrier identification while maintaining the features of naïve pools in terms of sensitivity, specificity, and the ability to estimate allele frequencies. We demonstrate the ability of our designs in extracting rare variations using short read data from the 1000 Genomes Pilot 3 project.

Mesh:

Substances:

Year:  2009        PMID: 19447964      PMCID: PMC2704440          DOI: 10.1101/gr.088559.108

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  23 in total

1.  A clone-array pooled shotgun strategy for sequencing large genomes.

Authors:  W W Cai; R Chen; R A Gibbs; A Bradley
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

2.  Quality and completeness of SNP databases.

Authors:  David E Reich; Stacey B Gabriel; David Altshuler
Journal:  Nat Genet       Date:  2003-03-24       Impact factor: 38.330

3.  The International HapMap Project.

Authors: 
Journal:  Nature       Date:  2003-12-18       Impact factor: 49.962

4.  Estimation of haplotype frequencies, linkage-disequilibrium measures, and combination of haplotype copies in each pool by use of pooled DNA data.

Authors:  Toshikazu Ito; Suenori Chiku; Eisuke Inoue; Makoto Tomita; Takayuki Morisaki; Hiroko Morisaki; Naoyuki Kamatani
Journal:  Am J Hum Genet       Date:  2003-01-17       Impact factor: 11.025

5.  Decoding randomly ordered DNA arrays.

Authors:  Kevin L Gunderson; Semyon Kruglyak; Michael S Graige; Francisco Garcia; Bahram G Kermani; Chanfeng Zhao; Diping Che; Todd Dickinson; Eliza Wickham; Jim Bierle; Dennis Doucet; Monika Milewski; Robert Yang; Chris Siegmund; Juergen Haas; Lixin Zhou; Arnold Oliphant; Jian-Bing Fan; Steven Barnard; Mark S Chee
Journal:  Genome Res       Date:  2004-04-12       Impact factor: 9.043

6.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

7.  Allele frequency distributions in pooled DNA samples: applications to mapping complex disease genes.

Authors:  S H Shaw; M M Carrasquillo; C Kashuk; E G Puffenberger; A Chakravarti
Journal:  Genome Res       Date:  1998-02       Impact factor: 9.043

8.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome.

Authors:  D G Wang; J B Fan; C J Siao; A Berno; P Young; R Sapolsky; G Ghandour; N Perkins; E Winchester; J Spencer; L Kruglyak; L Stein; L Hsie; T Topaloglou; E Hubbell; E Robinson; M Mittmann; M S Morris; N Shen; D Kilburn; J Rioux; C Nusbaum; S Rozen; T J Hudson; R Lipshutz; M Chee; E S Lander
Journal:  Science       Date:  1998-05-15       Impact factor: 47.728

9.  Genomic mapping by fingerprinting random clones: a mathematical analysis.

Authors:  E S Lander; M S Waterman
Journal:  Genomics       Date:  1988-04       Impact factor: 5.736

Review 10.  Searching for genetic determinants in the new millennium.

Authors:  N J Risch
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

View more
  38 in total

1.  Multi-sample pooling and illumina genome analyzer sequencing methods to determine gene sequence variation for database development.

Authors:  Rebecca L Margraf; Jacob D Durtschi; Shale Dames; David C Pattison; Jack E Stephens; Rong Mao; Karl V Voelkerding
Journal:  J Biomol Tech       Date:  2010-09

2.  High-throughput discovery of rare insertions and deletions in large cohorts.

Authors:  Francesco L M Vallania; Todd E Druley; Enrique Ramos; Jue Wang; Ingrid Borecki; Michael Province; Robi D Mitra
Journal:  Genome Res       Date:  2010-11-01       Impact factor: 9.043

3.  Large-scale mapping of transposable element insertion sites using digital encoding of sample identity.

Authors:  Daryl M Gohl; Limor Freifeld; Marion Silies; Jennifer J Hwa; Mark Horowitz; Thomas R Clandinin
Journal:  Genetics       Date:  2013-12-27       Impact factor: 4.562

4.  eALPS: estimating abundance levels in pooled sequencing using available genotyping data.

Authors:  Itamar Eskin; Farhad Hormozdiari; Lucia Conde; Jacques Riby; Christine F Skibola; Eleazar Eskin; Eran Halperin
Journal:  J Comput Biol       Date:  2013-10-21       Impact factor: 1.479

5.  Combinatorics and next-generation sequencing.

Authors:  Nick Patterson; Stacey Gabriel
Journal:  Nat Biotechnol       Date:  2009-09       Impact factor: 54.908

6.  Variant identification in multi-sample pools by illumina genome analyzer sequencing.

Authors:  Rebecca L Margraf; Jacob D Durtschi; Shale Dames; David C Pattison; Jack E Stephens; Karl V Voelkerding
Journal:  J Biomol Tech       Date:  2011-07

7.  Rare variant discovery and calling by sequencing pooled samples with overlaps.

Authors:  Wenhui Wang; Xiaolin Yin; Yoon Soo Pyon; Matthew Hayes; Jing Li
Journal:  Bioinformatics       Date:  2012-10-27       Impact factor: 6.937

Review 8.  Next-generation sequencing in aging research: emerging applications, problems, pitfalls and possible solutions.

Authors:  João Pedro de Magalhães; Caleb E Finch; Georges Janssens
Journal:  Ageing Res Rev       Date:  2009-11-10       Impact factor: 10.895

9.  A statistical method for the detection of variants from next-generation resequencing of DNA pools.

Authors:  Vikas Bansal
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

10.  Identification of rare alleles and their carriers using compressed se(que)nsing.

Authors:  Noam Shental; Amnon Amir; Or Zuk
Journal:  Nucleic Acids Res       Date:  2010-08-10       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.