Literature DB >> 33816897

HACSim: an R package to estimate intraspecific sample sizes for genetic diversity assessment using haplotype accumulation curves.

Jarrett D Phillips1, Steven H French1, Robert H Hanner2, Daniel J Gillis1.   

Abstract

Assessing levels of standing genetic variation within species requires a robust sampling for the purpose of accurate specimen identification using molecular techniques such as DNA barcoding; however, statistical estimators for what constitutes a robust sample are currently lacking. Moreover, such estimates are needed because most species are currently represented by only one or a few sequences in existing databases, which can safely be assumed to be undersampled. Unfortunately, sample sizes of 5-10 specimens per species typically seen in DNA barcoding studies are often insufficient to adequately capture within-species genetic diversity. Here, we introduce a novel iterative extrapolation simulation algorithm of haplotype accumulation curves, called HACSim (Haplotype Accumulation Curve Simulator) that can be employed to calculate likely sample sizes needed to observe the full range of DNA barcode haplotype variation that exists for a species. Using uniform haplotype and non-uniform haplotype frequency distributions, the notion of sampling sufficiency (the sample size at which sampling accuracy is maximized and above which no new sampling information is likely to be gained) can be gleaned. HACSim can be employed in two primary ways to estimate specimen sample sizes: (1) to simulate haplotype sampling in hypothetical species, and (2) to simulate haplotype sampling in real species mined from public reference sequence databases like the Barcode of Life Data Systems (BOLD) or GenBank for any genomic marker of interest. While our algorithm is globally convergent, runtime is heavily dependent on initial sample sizes and skewness of the corresponding haplotype frequency distribution. ©2020 Phillips et al.

Entities:  

Keywords:  Algorithm; DNA barcoding; Extrapolation; Iterative method; Sampling sufficiency; Species

Year:  2020        PMID: 33816897      PMCID: PMC7924493          DOI: 10.7717/peerj-cs.243

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  59 in total

1.  Barcode-based species delimitation in the marine realm: a test using Hexanauplia (Multicrustacea: Thecostraca and Copepoda).

Authors:  Robert G Young; Cathryn L Abbott; Thomas W Therriault; Sarah J Adamowicz
Journal:  Genome       Date:  2016-10-21       Impact factor: 2.166

2.  Glacial cycles as an allopatric speciation pump in north-eastern American freshwater fishes.

Authors:  Julien April; Robert H Hanner; Anne-Marie Dion-Côté; Louis Bernatchez
Journal:  Mol Ecol       Date:  2012-12-03       Impact factor: 6.185

3.  DNA barcoding Australia's fish species.

Authors:  Robert D Ward; Tyler S Zemlak; Bronwyn H Innes; Peter R Last; Paul D N Hebert
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2005-10-29       Impact factor: 6.237

4.  A simple 2D non-parametric resampling statistical approach to assess confidence in species identification in DNA barcoding--an alternative to likelihood and bayesian approaches.

Authors:  Qian Jin; Li-Jun He; Ai-Bing Zhang
Journal:  PLoS One       Date:  2012-12-11       Impact factor: 3.240

5.  Is DNA barcoding actually cheaper and faster than traditional morphological methods: results from a survey of freshwater bioassessment efforts in the United States?

Authors:  Eric D Stein; Maria C Martinez; Sara Stiles; Peter E Miller; Evgeny V Zakharov
Journal:  PLoS One       Date:  2014-04-22       Impact factor: 3.240

6.  A simulation study of sample size for DNA barcoding.

Authors:  Arong Luo; Haiqiang Lan; Cheng Ling; Aibing Zhang; Lei Shi; Simon Y W Ho; Chaodong Zhu
Journal:  Ecol Evol       Date:  2015-12-01       Impact factor: 2.912

7.  Evaluating sampling strategy for DNA barcoding study of coastal and inland halo-tolerant Poaceae and Chenopodiaceae: A case study for increased sample size.

Authors:  Peng-Cheng Yao; Hai-Yan Gao; Ya-Nan Wei; Jian-Hang Zhang; Xiao-Yong Chen; Hong-Qing Li
Journal:  PLoS One       Date:  2017-09-21       Impact factor: 3.240

8.  DNA analysis of traded shark fins and mobulid gill plates reveals a high proportion of species of conservation concern.

Authors:  Dirk Steinke; Andrea M Bernard; Rebekah L Horn; Paul Hilton; Robert Hanner; Mahmood S Shivji
Journal:  Sci Rep       Date:  2017-08-25       Impact factor: 4.379

9.  Sampling strategy and potential utility of indels for DNA barcoding of closely related plant species: a case study in taxus.

Authors:  Jie Liu; Jim Provan; Lian-Ming Gao; De-Zhu Li
Journal:  Int J Mol Sci       Date:  2012-07-13       Impact factor: 6.208

10.  Genes Suggest Ancestral Colour Polymorphisms Are Shared across Morphologically Cryptic Species in Arctic Bumblebees.

Authors:  Paul H Williams; Alexandr M Byvaltsev; Björn Cederberg; Mikhail V Berezin; Frode Ødegaard; Claus Rasmussen; Leif L Richardson; Jiaxing Huang; Cory S Sheffield; Suzanne T Williams
Journal:  PLoS One       Date:  2015-12-10       Impact factor: 3.240

View more
  2 in total

Review 1.  Opportunities and challenges of macrogenetic studies.

Authors:  Deborah M Leigh; Charles B van Rees; Katie L Millette; Martin F Breed; Chloé Schmidt; Laura D Bertola; Brian K Hand; Margaret E Hunter; Evelyn L Jensen; Francine Kershaw; Libby Liggins; Gordon Luikart; Stéphanie Manel; Joachim Mergeay; Joshua M Miller; Gernot Segelbacher; Sean Hoban; Ivan Paz-Vinas
Journal:  Nat Rev Genet       Date:  2021-08-18       Impact factor: 53.242

2.  Genetic diversity of the Nubian ibex in Oman as revealed by mitochondrial DNA.

Authors:  Mataab K Al-Ghafri; Patrick J C White; Robert A Briers; Kara L Dicks; Alex Ball; Muhammad Ghazali; Steven Ross; Taimur Al-Said; Haitham Al-Amri; Mudhafar Al-Umairi; Hani Al-Saadi; Ali Aka'ak; Ahmed Hardan; Nasser Zabanoot; Mark Craig; Helen Senn
Journal:  R Soc Open Sci       Date:  2021-05-26       Impact factor: 2.963

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.