Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Rare variant discovery and calling by sequencing pooled samples with overlaps.

Literature DB >> 23104896

Rare variant discovery and calling by sequencing pooled samples with overlaps.

Wenhui Wang¹, Xiaolin Yin, Yoon Soo Pyon, Matthew Hayes, Jing Li.

Abstract

MOTIVATION: For many complex traits/diseases, it is believed that rare variants account for some of the missing heritability that cannot be explained by common variants. Sequencing a large number of samples through DNA pooling is a cost-effective strategy to discover rare variants and to investigate their associations with phenotypes. Overlapping pool designs provide further benefit because such approaches can potentially identify variant carriers, which is important for downstream applications of association analysis of rare variants. However, existing algorithms for analysing sequence data from overlapping pools are limited.
RESULTS: We propose a complete data analysis framework for overlapping pool designs, with novelties in all three major steps: variant pool and variant locus identification, variant allele frequency estimation and variant sample decoding. The framework can be used in combination with any design matrix. We have investigated its performance based on two different overlapping designs and have compared it with three state-of-the-art methods, by simulating targeted sequencing and by pooling real sequence data. Results on both datasets show that our algorithm has made significant improvements over existing ones. In conclusion, successful discovery of rare variants and identification of variant carriers using overlapping pool strategies critically depend on many steps, from generation of design matrixes to decoding algorithms. The proposed framework in combination with the design matrixes generated based on the Chinese remainder theorem achieves best overall results. AVAILABILITY: Source code of the program, termed VIP for Variant Identification by Pooling, is available at http://cbc.case.edu/VIP.

Entities: Gene

Mesh：

Year: 2012 PMID： 23104896 PMCID： PMC3530907 DOI： 10.1093/bioinformatics/bts645

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

29 in total

1. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors: Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal: Genome Res Date: 2010-07-19 Impact factor: 9.043

2. Mutation discovery by targeted genomic enrichment of multiplexed barcoded samples.

Authors: Isaäc J Nijman; Michal Mokry; Ruben van Boxtel; Pim Toonen; Ewart de Bruijn; Edwin Cuppen
Journal: Nat Methods Date: 2010-10-17 Impact factor: 28.547

3. A haplotype map of the human genome.

Authors:
Journal: Nature Date: 2005-10-27 Impact factor: 49.962

4. DNA Sudoku--harnessing high-throughput sequencing for multiplexed specimen analysis.

Authors: Yaniv Erlich; Kenneth Chang; Assaf Gordon; Roy Ronen; Oron Navon; Michelle Rooks; Gregory J Hannon
Journal: Genome Res Date: 2009-05-15 Impact factor: 9.043

5. Integrating common and rare genetic variation in diverse human populations.

Authors: David M Altshuler; Richard A Gibbs; Leena Peltonen; David M Altshuler; Richard A Gibbs; Leena Peltonen; Emmanouil Dermitzakis; Stephen F Schaffner; Fuli Yu; Leena Peltonen; Emmanouil Dermitzakis; Penelope E Bonnen; David M Altshuler; Richard A Gibbs; Paul I W de Bakker; Panos Deloukas; Stacey B Gabriel; Rhian Gwilliam; Sarah Hunt; Michael Inouye; Xiaoming Jia; Aarno Palotie; Melissa Parkin; Pamela Whittaker; Fuli Yu; Kyle Chang; Alicia Hawes; Lora R Lewis; Yanru Ren; David Wheeler; Richard A Gibbs; Donna Marie Muzny; Chris Barnes; Katayoon Darvishi; Matthew Hurles; Joshua M Korn; Kati Kristiansson; Charles Lee; Steven A McCarrol; James Nemesh; Emmanouil Dermitzakis; Alon Keinan; Stephen B Montgomery; Samuela Pollack; Alkes L Price; Nicole Soranzo; Penelope E Bonnen; Richard A Gibbs; Claudia Gonzaga-Jauregui; Alon Keinan; Alkes L Price; Fuli Yu; Verneri Anttila; Wendy Brodeur; Mark J Daly; Stephen Leslie; Gil McVean; Loukas Moutsianas; Huy Nguyen; Stephen F Schaffner; Qingrun Zhang; Mohammed J R Ghori; Ralph McGinnis; William McLaren; Samuela Pollack; Alkes L Price; Stephen F Schaffner; Fumihiko Takeuchi; Sharon R Grossman; Ilya Shlyakhter; Elizabeth B Hostetter; Pardis C Sabeti; Clement A Adebamowo; Morris W Foster; Deborah R Gordon; Julio Licinio; Maria Cristina Manca; Patricia A Marshall; Ichiro Matsuda; Duncan Ngare; Vivian Ota Wang; Deepa Reddy; Charles N Rotimi; Charmaine D Royal; Richard R Sharp; Changqing Zeng; Lisa D Brooks; Jean E McEwen
Journal: Nature Date: 2010-09-02 Impact factor: 49.962

6. A map of human genome variation from population-scale sequencing.

Authors: Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal: Nature Date: 2010-10-28 Impact factor: 49.962

7. High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency.

Authors: Sarah E Calvo; Elena J Tucker; Alison G Compton; Denise M Kirby; Gabriel Crawford; Noel P Burtt; Manuel Rivas; Candace Guiducci; Damien L Bruno; Olga A Goldberger; Michelle C Redman; Esko Wiltshire; Callum J Wilson; David Altshuler; Stacey B Gabriel; Mark J Daly; David R Thorburn; Vamsi K Mootha
Journal: Nat Genet Date: 2010-09-05 Impact factor: 38.330

8. Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples.

Authors: Andrew M Smith; Lawrence E Heisler; Robert P St Onge; Eveline Farias-Hesson; Iain M Wallace; John Bodeau; Adam N Harris; Kathleen M Perry; Guri Giaever; Nader Pourmand; Corey Nislow
Journal: Nucleic Acids Res Date: 2010-05-11 Impact factor: 16.971

9. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors:
Journal: Nature Date: 2007-06-07 Impact factor: 49.962

10. Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes.

Authors: Sergey Nejentsev; Neil Walker; David Riches; Michael Egholm; John A Todd
Journal: Science Date: 2009-03-05 Impact factor: 47.728

1 in total

1. Accurate detection of subclonal single nucleotide variants in whole genome amplified and pooled cancer samples using HaloPlex target enrichment.

Authors: Eva C Berglund; Carl Mårten Lindqvist; Shahina Hayat; Elin Övernäs; Niklas Henriksson; Jessica Nordlund; Per Wahlberg; Erik Forestier; Gudmar Lönnerholm; Ann-Christine Syvänen
Journal: BMC Genomics Date: 2013-12-05 Impact factor: 3.969

1 in total