Literature DB >> 23730349

On Estimation of Allele Frequencies via Next-Generation DNA Resequencing with Barcoding.

Joon Sang Lee1, Hongyu Zhao.   

Abstract

Next Generation Sequencing (NGS) has revolutionized biomedical research in recent years. It is now commonly used to identify rare variants through re-sequencing individual genomes. Due to the cost of NGS, researchers have considered pooling samples as a cost-effective alternative to individual sequencing. In this article, we consider the estimation of allele frequencies of rare variants through the NGS technologies with pooled DNA samples with or without barcodes. We consider three methods for estimating allele frequencies from such data, including raw sequencing counts, inferred genotypes, and expected minor allele counts and compare their performance. Our simulation results suggest that the estimator based on inferred genotypes overall performs better than or as well as the other two estimators. When the sequencing coverage is low, biases and MSEs can be sensitive to the choice of the prior probabilities of genotypes for the estimators based on inferred genotypes and expected minor allele counts so that more accurate specification of prior probabilities is critical to lower biases and MSEs. Our study shows that the optimal number of barcodes in a pool is relatively robust to the frequencies of rare variants at a specific coverage depth. We provide general guidelines on using DNA pooling with barcoding for the estimation of allele frequencies of rare variants.

Entities:  

Year:  2013        PMID: 23730349      PMCID: PMC3666873          DOI: 10.1007/s12561-013-9084-y

Source DB:  PubMed          Journal:  Stat Biosci        ISSN: 1867-1764


  18 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  Estimation of allele frequencies from high-coverage genome-sequencing projects.

Authors:  Michael Lynch
Journal:  Genetics       Date:  2009-03-16       Impact factor: 4.562

Review 4.  Next-generation DNA sequencing methods.

Authors:  Elaine R Mardis
Journal:  Annu Rev Genomics Hum Genet       Date:  2008       Impact factor: 8.929

5.  K+ channel mutations in adrenal aldosterone-producing adenomas and hereditary hypertension.

Authors:  Murim Choi; Ute I Scholl; Peng Yue; Peyman Björklund; Bixiao Zhao; Carol Nelson-Williams; Weizhen Ji; Yoonsang Cho; Aniruddh Patel; Clara J Men; Elias Lolis; Max V Wisgerhof; David S Geller; Shrikant Mane; Per Hellman; Gunnar Westin; Göran Åkerström; Wenhui Wang; Tobias Carling; Richard P Lifton
Journal:  Science       Date:  2011-02-11       Impact factor: 47.728

6.  A statistical method for the detection of variants from next-generation resequencing of DNA pools.

Authors:  Vikas Bansal
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

7.  On optimal pooling designs to identify rare variants through massive resequencing.

Authors:  Joon Sang Lee; Murim Choi; Xiting Yan; Richard P Lifton; Hongyu Zhao
Journal:  Genet Epidemiol       Date:  2011-01-19       Impact factor: 2.135

8.  Estimation of allele frequency and association mapping using next-generation sequencing data.

Authors:  Su Yeon Kim; Kirk E Lohmueller; Anders Albrechtsen; Yingrui Li; Thorfinn Korneliussen; Geng Tian; Niels Grarup; Tao Jiang; Gitte Andersen; Daniel Witte; Torben Jorgensen; Torben Hansen; Oluf Pedersen; Jun Wang; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2011-06-11       Impact factor: 3.169

9.  Targeted high-throughput sequencing of tagged nucleic acid samples.

Authors:  Matthias Meyer; Udo Stenzel; Sean Myles; Kay Prüfer; Michael Hofreiter
Journal:  Nucleic Acids Res       Date:  2007-08-01       Impact factor: 16.971

10.  Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations.

Authors:  Brian J O'Roak; Pelagia Deriziotis; Choli Lee; Laura Vives; Jerrod J Schwartz; Santhosh Girirajan; Emre Karakoc; Alexandra P Mackenzie; Sarah B Ng; Carl Baker; Mark J Rieder; Deborah A Nickerson; Raphael Bernier; Simon E Fisher; Jay Shendure; Evan E Eichler
Journal:  Nat Genet       Date:  2011-05-15       Impact factor: 38.330

View more
  1 in total

1.  Likelihood-based complex trait association testing for arbitrary depth sequencing data.

Authors:  Song Yan; Shuai Yuan; Zheng Xu; Baqun Zhang; Bo Zhang; Guolian Kang; Andrea Byrnes; Yun Li
Journal:  Bioinformatics       Date:  2015-05-14       Impact factor: 6.937

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.