Literature DB >> 23104896

Rare variant discovery and calling by sequencing pooled samples with overlaps.

Wenhui Wang1, Xiaolin Yin, Yoon Soo Pyon, Matthew Hayes, Jing Li.   

Abstract

MOTIVATION: For many complex traits/diseases, it is believed that rare variants account for some of the missing heritability that cannot be explained by common variants. Sequencing a large number of samples through DNA pooling is a cost-effective strategy to discover rare variants and to investigate their associations with phenotypes. Overlapping pool designs provide further benefit because such approaches can potentially identify variant carriers, which is important for downstream applications of association analysis of rare variants. However, existing algorithms for analysing sequence data from overlapping pools are limited.
RESULTS: We propose a complete data analysis framework for overlapping pool designs, with novelties in all three major steps: variant pool and variant locus identification, variant allele frequency estimation and variant sample decoding. The framework can be used in combination with any design matrix. We have investigated its performance based on two different overlapping designs and have compared it with three state-of-the-art methods, by simulating targeted sequencing and by pooling real sequence data. Results on both datasets show that our algorithm has made significant improvements over existing ones. In conclusion, successful discovery of rare variants and identification of variant carriers using overlapping pool strategies critically depend on many steps, from generation of design matrixes to decoding algorithms. The proposed framework in combination with the design matrixes generated based on the Chinese remainder theorem achieves best overall results. AVAILABILITY: Source code of the program, termed VIP for Variant Identification by Pooling, is available at http://cbc.case.edu/VIP.

Entities:  

Mesh:

Year:  2012        PMID: 23104896      PMCID: PMC3530907          DOI: 10.1093/bioinformatics/bts645

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Mutation discovery by targeted genomic enrichment of multiplexed barcoded samples.

Authors:  Isaäc J Nijman; Michal Mokry; Ruben van Boxtel; Pim Toonen; Ewart de Bruijn; Edwin Cuppen
Journal:  Nat Methods       Date:  2010-10-17       Impact factor: 28.547

3.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

4.  DNA Sudoku--harnessing high-throughput sequencing for multiplexed specimen analysis.

Authors:  Yaniv Erlich; Kenneth Chang; Assaf Gordon; Roy Ronen; Oron Navon; Michelle Rooks; Gregory J Hannon
Journal:  Genome Res       Date:  2009-05-15       Impact factor: 9.043

5.  Integrating common and rare genetic variation in diverse human populations.

Authors:  David M Altshuler; Richard A Gibbs; Leena Peltonen; David M Altshuler; Richard A Gibbs; Leena Peltonen; Emmanouil Dermitzakis; Stephen F Schaffner; Fuli Yu; Leena Peltonen; Emmanouil Dermitzakis; Penelope E Bonnen; David M Altshuler; Richard A Gibbs; Paul I W de Bakker; Panos Deloukas; Stacey B Gabriel; Rhian Gwilliam; Sarah Hunt; Michael Inouye; Xiaoming Jia; Aarno Palotie; Melissa Parkin; Pamela Whittaker; Fuli Yu; Kyle Chang; Alicia Hawes; Lora R Lewis; Yanru Ren; David Wheeler; Richard A Gibbs; Donna Marie Muzny; Chris Barnes; Katayoon Darvishi; Matthew Hurles; Joshua M Korn; Kati Kristiansson; Charles Lee; Steven A McCarrol; James Nemesh; Emmanouil Dermitzakis; Alon Keinan; Stephen B Montgomery; Samuela Pollack; Alkes L Price; Nicole Soranzo; Penelope E Bonnen; Richard A Gibbs; Claudia Gonzaga-Jauregui; Alon Keinan; Alkes L Price; Fuli Yu; Verneri Anttila; Wendy Brodeur; Mark J Daly; Stephen Leslie; Gil McVean; Loukas Moutsianas; Huy Nguyen; Stephen F Schaffner; Qingrun Zhang; Mohammed J R Ghori; Ralph McGinnis; William McLaren; Samuela Pollack; Alkes L Price; Stephen F Schaffner; Fumihiko Takeuchi; Sharon R Grossman; Ilya Shlyakhter; Elizabeth B Hostetter; Pardis C Sabeti; Clement A Adebamowo; Morris W Foster; Deborah R Gordon; Julio Licinio; Maria Cristina Manca; Patricia A Marshall; Ichiro Matsuda; Duncan Ngare; Vivian Ota Wang; Deepa Reddy; Charles N Rotimi; Charmaine D Royal; Richard R Sharp; Changqing Zeng; Lisa D Brooks; Jean E McEwen
Journal:  Nature       Date:  2010-09-02       Impact factor: 49.962

6.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

7.  High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency.

Authors:  Sarah E Calvo; Elena J Tucker; Alison G Compton; Denise M Kirby; Gabriel Crawford; Noel P Burtt; Manuel Rivas; Candace Guiducci; Damien L Bruno; Olga A Goldberger; Michelle C Redman; Esko Wiltshire; Callum J Wilson; David Altshuler; Stacey B Gabriel; Mark J Daly; David R Thorburn; Vamsi K Mootha
Journal:  Nat Genet       Date:  2010-09-05       Impact factor: 38.330

8.  Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples.

Authors:  Andrew M Smith; Lawrence E Heisler; Robert P St Onge; Eveline Farias-Hesson; Iain M Wallace; John Bodeau; Adam N Harris; Kathleen M Perry; Guri Giaever; Nader Pourmand; Corey Nislow
Journal:  Nucleic Acids Res       Date:  2010-05-11       Impact factor: 16.971

9.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

10.  Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes.

Authors:  Sergey Nejentsev; Neil Walker; David Riches; Michael Egholm; John A Todd
Journal:  Science       Date:  2009-03-05       Impact factor: 47.728

View more
  1 in total

1.  Accurate detection of subclonal single nucleotide variants in whole genome amplified and pooled cancer samples using HaloPlex target enrichment.

Authors:  Eva C Berglund; Carl Mårten Lindqvist; Shahina Hayat; Elin Övernäs; Niklas Henriksson; Jessica Nordlund; Per Wahlberg; Erik Forestier; Gudmar Lönnerholm; Ann-Christine Syvänen
Journal:  BMC Genomics       Date:  2013-12-05       Impact factor: 3.969

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.