Literature DB >> 16794926

Occupancy modeling of coverage distribution for whole genome shotgun DNA sequencing.

Michael C Wendl1.   

Abstract

Expected-value models have long provided a rudimentary theoretical foundation for random DNA sequencing. Here, we are interested in improving characterization of genome coverage in terms of its underlying probability distributions. We find that the mathematical notion of occupancy serves as a good model for evolution of the coverage distribution function and reveals new insights related to sequence redundancy. Established concepts, such as "full shotgun depth," have been assumed invariant, but actually depend on project size and decrease over time. For most microbial projects, the full shotgun milestone should be revised downward by about 30%. Accordingly, many already-completed genomes appear to have been over-sequenced. Results also suggest that read lengths for emerging high-throughput sequencing methods must be increased substantially before they can be considered as possible successors to the standard Sanger method. In particular, gains in throughput and sequence depth cannot be made to compensate for diminished read length. Limits are well approximated by a simple logarithmic equation, which should be useful in estimating maximum coverage-based redundancy for future projects.

Entities:  

Mesh:

Year:  2006        PMID: 16794926     DOI: 10.1007/s11538-005-9021-4

Source DB:  PubMed          Journal:  Bull Math Biol        ISSN: 0092-8240            Impact factor:   1.758


  8 in total

1.  Occupancy modeling, maximum contig size probabilities and designing metagenomics experiments.

Authors:  Stephen A Stanhope
Journal:  PLoS One       Date:  2010-07-29       Impact factor: 3.240

2.  Coverage theories for metagenomic DNA sequencing based on a generalization of Stevens' theorem.

Authors:  Michael C Wendl; Karthik Kota; George M Weinstock; Makedonka Mitreva
Journal:  J Math Biol       Date:  2012-09-11       Impact factor: 2.259

3.  Marker density and read depth for genotyping populations using genotyping-by-sequencing.

Authors:  Timothy M Beissinger; Candice N Hirsch; Rajandeep S Sekhon; Jillian M Foerster; James M Johnson; German Muttoni; Brieanne Vaillancourt; C Robin Buell; Shawn M Kaeppler; Natalia de Leon
Journal:  Genetics       Date:  2013-02-14       Impact factor: 4.562

4.  Bounds on the distribution of the number of gaps when circles and lines are covered by fragments: theory and practical application to genomic and metagenomic projects.

Authors:  John Moriarty; Julian R Marchesi; Anthony Metcalfe
Journal:  BMC Bioinformatics       Date:  2007-03-02       Impact factor: 3.169

5.  Estimating DNA coverage and abundance in metagenomes using a gamma approximation.

Authors:  Sean D Hooper; Daniel Dalevi; Amrita Pati; Konstantinos Mavromatis; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Bioinformatics       Date:  2009-12-14       Impact factor: 6.937

6.  Aspects of coverage in medical DNA sequencing.

Authors:  Michael C Wendl; Richard K Wilson
Journal:  BMC Bioinformatics       Date:  2008-05-16       Impact factor: 3.169

7.  Viral genome sequencing by random priming methods.

Authors:  Appolinaire Djikeng; Rebecca Halpin; Ryan Kuzmickas; Jay Depasse; Jeremy Feldblyum; Naomi Sengamalay; Claudio Afonso; Xinsheng Zhang; Norman G Anderson; Elodie Ghedin; David J Spiro
Journal:  BMC Genomics       Date:  2008-01-07       Impact factor: 3.969

8.  Increasing Genome Sampling and Improving SNP Genotyping for Genotyping-by-Sequencing with New Combinations of Restriction Enzymes.

Authors:  Yong-Bi Fu; Gregory W Peterson; Yibo Dong
Journal:  G3 (Bethesda)       Date:  2016-04-07       Impact factor: 3.154

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.